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D ' Abstract 
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The large majority of commercially available multiple-input multiple-output (MIMO) radio channel 
measurement devices (sounders) is based on time-division multiplexed switching (TDMS) of a single 
transmit/receive radio frequency chain into the elements of a transmit/receive antenna array. While being 
cost-effective, such a solution can cause significant measurement errors due to phase noise and frequency 
offset in the local oscillators. In this paper, we systematically analyze the resulting errors and show that, 
in practice, overestimation of channel capacity by several hundred percent can occur. Overestimation 
'nJ" ' is caused by phase noise (and to a lesser extent frequency offset) leading to an increase of the MIMO 

■ channel rank. Our analysis furthermore reveals that the impact of phase errors is, in general, most 

o . 

, pronounced if the physical channel has low rank (typical for line-of-sight or poor scattering scenarios). 

o 

The extreme case of a rank-1 physical channel is analyzed in detail. Finally, we present measurement 



C/3 



■ results obtained from a commercially employed TDMS-based MIMO channel sounder In the light of 

H : 

the findings of this paper, the results obtained through MIMO channel measurement campaigns using 
TDMS-based channel sounders should be interpreted with great care. 
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I. Introduction 

Multiple-input multiple-output (MIMO) wireless communication promises significant improve- 
ments over existing wireless systems both in terms of spectral efficiency and link reliability. 
Obtaining accurate measurements of MIMO radio channels is of key importance to devising 
accurate MIMO radio channel models, which in turn are vital for system design, simulation, and 
performance analysis. 

A common and widespread MIMO channel measurement device (a.k.a. sounder) design is 
based on time-division multiplexing with synchronous switching, or time-division multiplexed 
switching (TDMS) for short, of a single radio frequency (RF) chain into the individual elements 
of an antenna array. TDMS can be used at either the transmitter or the receiver {one-sided TDMS) 
or at both sides of the link {double- sided TDMS). For the latter case, which is practically the most 
relevant one, such an architecture is depicted in Fig.[Il TDMS constitutes a natural extension 
of single-input single-output (SISO) channel sounders and leads to very cost-effective solutions 
as only a single RF chain is required at either the transmitter or the receiver (one-sided TDMS) 
or at both sides of the link (double-sided TDMS). To the best of our knowledge, the large 
majority of commercially available MIMO channel sounders is based on the TDMS principle. A 
major drawback of TDMS-based sounder architectures results from temporal phase deviations 
between the outputs of the local oscillators (LOs) in the RF chains at transmitter and receiver 
being translated into the spatial domain due to switching across antenna elements. This can 
cause an increase of the MIMO channel rank and corresponding measurement errors, in terms 
of estimated MIMO channel capacity, that can be on the order of several hundred percent. It 
is therefore immediately clear that understanding the impact of phase errorjl] in TDMS-based 
sounding is of fundamental importance. 

One may argue that in a wireless communication link the impact of phase fluctuations in the 

'For brevity, in tlie remainder of this paper, we use the terminology phase errors whenever we refer to phase deviations due 
to phase noise, or frequency offset, or both. 



BAUM AND BOLCSKEI: INFORMATION-THEORETIC ANALYSIS OF MIMO CHANNEL SOUNDING 3 

transmitter and/or the receiver can simply be absorbed into an effective channel consisting of 
the physical propagation channel combined with LO-related (and potentially other) impairments. 
Channel estimation at the receiver for demodulation and decoding or for precoding at the 
transmitter (through feedback) would then simply work on the effective channel. This point 
of view can certainly be sensible in a data transmission setup if the frequency dispersion caused 
by phase fluctuations is small compared to that induced by the physical channel. In a channel 
sounding setup, however, it is crucial to separate the physical propagation channel (i.e., the 
object to be measured) from transmitter/receiver impairments, in order to obtain measurement 
results that depend as little as possible on the measurement device (sounder) used. Furthermore, 
as already pointed out, the measurement procedure employed in TDMS-based MIMO channel 
sounding results in very high sensitivity of the estimated channel capacity with respect to phase 
errors. 

Contributions: The goal of this paper is to systematically analyze the impact of phase noise 
and frequency offset (between transmitter and receiver LO) on estimated MIMO channel capacity 
when TDMS-based channel sounders are used. In particular, we show that the presence of phase 
errors can lead to significant overestimation of MIMO channel capacity. A sensitivity analysis 
reveals that, in certain cases, underestimation is possible as well, albeit typically resulting in 
significantly smaller errors. 

We start by devising a signal model, applicable to the wide class of correlation-based (as 
defined in [1, Sec. Ill]) MIMO channel sounders and taking into account phase noise and 
frequency offset. We then systematically identify situations where phase errors have no (or little) 
impact on MIMO (ergodic and outage) capacity estimates and where they lead to significant 
estimation errors. As an extreme case in the latter category, we demonstrate that even moderate 
phase noise can turn a rank-1 physical channel (e.g., a pin-hole channel [2], [3]) into a full- 
rank effective channel; analytic expressions for the corresponding (ergodic and outage) capacity 
estimates are provided. Our analytic results are supported by measurement results obtained from 



a commercially employed TDMS -based MIMO channel sounder. 

Previous related work: For linear time-varying SISO physical channels, an analysis of the 
systematic measurement errors incurred by correlative channel sounders (due to the time-varying 
nature of the physical channel) is reported in [1]. Models for the phase noise power spectral 
density (PSD) of a 5 GHz and a 50 GHz frequency synthesizer as well as an expression for 
the phase-noise induced reduction of the dynamic range of m-sequences can be found in [4]. A 
subsequent paper by the same authors [5] discusses the impact of frequency offset on direction 
of arrival (DOA) estimates in single-input multiple-output (SIMO) TDMS-based measurements 
and proposes corresponding mitigation methods. The effect of random- walk phase noise on the 
root mean-square (rms) error of SAGEnl-based DOA estimates is analyzed in [7]. 

For fully parallel MIMO channel sounders, i.e., channel sounders employing a separate RF 
chain for each transmit and each receive antenna element, the impact of gain imbalance in parallel 
RF chains and of thermal noise on the estimated capacity of a physical rank-1 MIMO channel 
is analyzed in [8]. The variance of an approximation of the error in the mutual information 
(MI) of an effective channel resulting from a deterministic MIMO channel subject to additive 
white complex Gaussian distributed perturbations is derived in [9]. For physical pin-hole [2], 
a.k.a. key-hole [3], [10], [11] (i.e., rank-1), MIMO channels in a controlled indoor environment, 
the impact of measurement imperfections such as thermal noise and "multi-path leakage" (i.e., 
multi-path components propagating between the transmitter and the receiver via paths other than 
through the pin-hole) on the channel eigenvalue distribution and the resulting outage capacity 
are analyzed numerically, based on measurements and simulations, in [11]. 

Organization of the paper: The remainder of this paper is organized as follows. In Section HIl 
the architecture of a TDMS-based MIMO channel sounder is described and the corresponding 
channel and signal model are provided. In Section Hill we analyze the effect of phase errors on 

^Space-alternating generalized expectation maximization (EM) algorithm [6]. 
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MIMO channel statistics. The corresponding impact on estimated MI is studied in Section |IVl A 
framework for analyzing the sensitivity of the MIMO channel MI to phase errors is developed in 
Section lYl Section |VI] is devoted to the special (but practically relevant) case of rank-1 physical 
channels. Measurement results performed on a typical, commercially employed TDMS -based 
MIMO channel sounder corroborating our analysis are presented in Section IVIIl We conclude 
in Section IVml 

Notation: Ej-} denotes the expectation operator. f{t) * g{t) = f{T)g{t — r) dr stands for 
the convolution of the functions f{t) and g{t). The Dirac delta function is denoted as 5{t), 
and 5i = 1 for i = and otherwise. The superscripts ^ , ^ , and * stand for transposition, 
conjugate transposition, and elementwise conjugation, respectively. An mx n matrix is a matrix 
with m rows and n columns. 1 and denote an all-ones and all-zeros matrix, respectively, of 
appropriate size. If required, the size of a matrix is specified through subscripts, e.g., lm,n- Im 
stands for the m x m identity matrix. A o B and A (g) B denote the Hadamard (pointwise) 
product and the Kronecker product, respectively, of the matrices A and B, and /°(A) stands 
for the matrix resulting from entry-wise application of the function /(■) to A. ||A||f is the 
Frobenius norm of A and Tr(A) is the trace of A. diag(x) denotes the diagonal matrix with the 
elements of the vector x on its main diagonal, and dg(A) = A o I zeros out all but the diagonal 
elements of A. The element in the mth row and nth column of A is denoted as [A]m_„. r(A) 
and Ai(A) stand for the rank and the ith eigenvalue of A, respectively. Unless explicitly stated 
otherwise, eigenvalues are sorted in decreasing order, i.e., Ai(A) > A2(A) > ■ ■ ■ > A„(A). 
For an m X n matrix A = [ai a2 ■ ■ ■ a„] with columns a^, we define the mn x 1 vector 
vec(A) = [af ag ■ ■ ■ a^]^- The commutation matrix K(m,n) [12, Sec. 3.7] is a permutation 
matrix of size mn x mn uniquely defined through 

K(„,„)vec(A) = vec(A^) (1) 
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where A is an m x n matrix. For brevity, we define divec(A) = diag(vec(A)). For two 

d d 

random variables (RVs) X and Y, X = Y and X Y stands for equivalence and approx- 
imate equivalence in distribution, respectively. For a RV X, the probability density function 
(pdf) and cumulative distribution function (cdf) are denoted by px(a;) and Yx{x), respectively, 
and the moment-generating function (MGF) is defined as Mx{s) — E{exp(sX)}. The vari- 
ance of a RV X is denoted as Var{X}. The covariance matrix of a complex random matrix 
X is defined as Cov{X} = E{(vec(X) - E{vec(X)}) (vec(X) - E{vec(X)})^} and the 
corresponding matrix of correlation coefficients is the matrix with entries [Corr{X}]^_„ = 

1 /2 

[Cov{X}]m^„/ ([Cov{X}]m,m[Cov{X}]„^„) ' . Thc pseudo-covariance matrix [13] of a complex 
random matrix X is defined as CoVp{X} = E{ (vec(X) -E{vec(X)}) (vec(X) -E{vec(X)})^}. 
For the integers a and b, adivfe = \p,/h\ denotes the integer division of a by h, and a mod 6 
stands for the remainder, on division of a by h. For a complex scalar 2; G C, the function Bxg{z) 
returns the argument (angle) of z in the interval [0, 27r), and Re(2;) and Im(^) stand for the real 
and imaginary part of z, respectively. All logarithms are to the base 2 unless explicitly stated 
otherwise. |3| denotes the length of an interval J. Throughout the paper, the number of transmit 
and receive antenna elements in a MIMO channel is denoted as Mr and Mr, respectively, and 
we will refer to antenna elements simply as antennas. 

A real Gaussian random vector is defined as a vector with jointly Gaussian (JG) elements and 
denoted by M{ra, C), where m is the mean and C is the covariance matrix. A complex Gaussian 
random vector is defined as a vector with JG real and imaginary parts. A complex random vector 
will be called proper if its pseudo-covariance matrix vanishes. CN{ra, C) stands for a proper 
complex Gaussian random vector with mean m and covariance matrix C. The complex random 
vectors x and y will be called jointly proper if the composite random vector having x and y as 
subvectors is proper. 

For a chi-square distributed RV with n degrees of freedom and variance 2na'^ we write ^2, 
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where xl,a^ = ||x|P with x = A/'(0, (7^I„). [xi X2 ■■■ x„] = Dn{ai,a2, . . . ,an), Xi > 
{i = 1, 2, . . . , n) denotes a Dirichlet distributed random vector with parameters ai, a2, . . . , a„. 
The corresponding subvector x — [xi X2 ■ ■ ■ Xn-i ] satisfies x = £)„_i(ai, 02, ... , a„_i; a„) 
and has joint pdf [14, Th. 1.2] 



Pr>„_i(ai,a2,.-,an-i;an)(^) ~ j-j-n r{a-) ^11"^*' ) ( ^ 



3Jj 



where a — Yl^=i(^i ^{^) is the Gamma function [15, Sec. 6.1]. We denote the beta 
distribution with parameters a and b as P{a,b) with pdf given by 

The digamma function '^{z) is defined as [15, Eq. 6.3.1, Eq. 6.3.16] 



^{z) = ^ iog,r(;.) = ^ = ^o(^), z e c 



oo 

Z 



(2) 



^,(z) = -7 + E TT' ^ ^ 0' ■ ■ ■ 

n(n + z — 1) 

n=l ^ ' 



where 7 fa 0.5772 is Euler's constant, and '\\)n{z) is the polygamma function [15, Eq. 6.4.1], 
defined as the nth derivative of '^{z). We will also need the following representation of the 
digamma function at positive integer multiples of 1/2 given by [15, Eq. 6.3.2, Eq. 6.3.4] 

*W = -7 + E^, *(^-^) =-7-21oge(2) + 5j^^, fceN,fc>l. (3) 

n=l ^ ^ n=l 

Finally, we note that the first derivative of the digamma function can be written as an infinite 
series as [15, Eq. 6.4.10] 

n=l ^ ' 
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II. MIMO Channel Sounding Based On Time-Division Multiplexed Switching 

In this section, we describe the system architecture of a correlation-based MIMO channel 
sounder employing TDMS and we present the corresponding signal model, taking into account 
the presence of phase errors. 

A. Channel Sounder Architecture 

The basic architecture of a TDMS-based MIMO channel sounder is depicted in Fig.[TJ A (pos- 
sibly complex) sounding signal x{t) is generated in baseband and modulated to the propagation 
channel center frequency. At the receiver, bandpass filtering (to remove out-of-band thermal 
noise) and downconversion to baseband, resulting in the signal u{t), is followed by extraction 
of the MIMO channel estimate. Both the transmitter and the receiver employ a multiplexing 
unit, which steps a single RF chain through all transmit/receive antennas sequentially in time 
following a prescribed switching pattern. Clocks at transmitter and receiver serve as reference 
for the analog-to-digital converter (ADC) and digital-to-analog converter (DAC) sampling rates, 
antenna multiplexing timing, and RF mixing (i.e., LO) frequencies. 

Another frequently used MIMO channel sounder setup employs a single antenna (either at the 
transmitter or the receiver or at both sides of the link), which is physically moved (in an automated 
fashion) to form a "virtual" antenna array. This setup also fits into the framework described 
in the paper. The impact of phase errors on MIMO channel capacity estimates in such a time- 
division-based "virtual" antenna array sounder architecture will, in general, be significantly more 
pronounced than in the TDMS-based case where a single RF chain is switched electronically 
into different physical antenna elements. This is because in the "virtual" antenna array case, the 
time that passes when moving the single antenna from a given physical position to the next one 
is much longer than the time it takes to switch electronically between different antenna elements. 
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B. SISO Signal Model 

We Start by presenting the signal model for a correlative SISO channel sounder in the presence 
of phase errors. This constitutes the basis for the MIMO signal model in TDMS-based sounders 
introduced in Section ITl-C[ 

Apart from frequency offset, a difference between the reference clocks at transmitter and 
receiver also causes a difference in the corresponding baseband sampling rates. The effect of 
this sampling rate offset, however, can be neglected compared to the frequency offset incurred by 
up- and down-conversion with respect to the LO frequencies, which are significantly higher than 
the sampling rates. The signal model presented below will therefore not account for sampling 
rate offset. 

1) Correlation-based sounding: The baseband sounding signal is given by x{t) = J2k -^(^ ~ 
tk) where s{t), supported on an inverval 3^, denotes the convolution of the (time-continuous) 
transmit sounding sequenced with the impulse response of the transmit frontend filter. Further, tk 
is the SISO snapshot (measurement) time instant, referred to as snapshot time in the following, 
and k denotes the SISO snapshot index. Because the sounder can have a measurement duty cycle 
less than one, the time intervals [tfc-i,tfc] generally contain measurement time and void time. In 
the following, we will refer to the quantities tfc — t/t-i as the SISO snapshot time distances. After 
bandpass filtering and downconversion, the receiver applies a linear time-invariant (LTI) filter 
with impulse response r(t), supported on an inverval Jr, and consisting of the receive sounding 
sequence convolved with the receive frontend filter. The signals s{t) and r{t) are chosen such 
that the function 

c{t) = s{t)*r{t) 



^The sounding sequence is a weighted chip-spaced Dirac train with the weights determined by the time-discrete sounding 
signal, e.g. a binary phase-shift keying (BPSK)-modulated m-sequence. 
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is peaky around t — 0, where it takes on its maximum value 

c(0) = / s{t)r{-t) dt = 1 (5) 

with Jtr = 3x1"! (— 3r). In the following, we will be interested in the behavior of the function 
c{t) in an interval 3c around t — 0. The sequence signal-to-noise ratio (SNR) quantifies the 
peakiness of c{t) (in an interval 3c). 

In the sequel, we denote channel sounders satisfying the conditions described in the previous 
paragraph as correlation(-based) sounders. In the literature, the term correlation sounder is 
usually reserved for a more specific setup where r{t) = s*{—t) with s{t) resulting from a 
BPSK modulated pseudo-(random) noise sequence (PNS) (typically an m-sequence). The widely 
used class of channel sounders employing chirp or multisine sounding sequences [1], [16]-[19] 
satisfies the conditions stated in the previous paragraph and hence fits into our framework. 

2) Modeling phase noise and frequency offset: The LOs in the transmitter and the receiver 
generate signals that have the analytic signal representations 

OT(t) = eJ(2-/Tt+VT(«)) and OR(t) = e^(2-/R*+^RW) 

respectively, where /t and /r are the corresponding desired LO frequencies, and (prit) and 
(pR{t) represent the corresponding additive LO phase. In the remainder of this paper, we assume 
that (pT{t) and (pK{t) are real- valued zero-mean wide-sense stationary (WSS) Gaussian random 
processes. We shall next justify the modeling assumptions made in this paragraph. 

The characterization of phase and frequency instabilities in precision frequency sources has 
been a major area of research for many years [20]-[26]. In the ensuing discussion, we distinguish 
between the directly observable random process o{t) — exp (^j (2,7: ft + (p{t))) and the underlying 
phase process (p{t). There are two fundamentally different models for (p{t) motivated by the 
corresponding different methods of frequency generation, namely free-running and closed-loop. 
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In the case of free-running oscillators, the phase process is often modeled as a continuous-time 
Wiener (a.k.a. Brownian motion) process, which is Gaussian and nonstationary, with autoco- 
variance function E{Lp(t)(p{t + r)} oc min(t, t + r) [23, Corollary 7.1]. It is important to note 
that even though the phase process is nonstationary and has a variance that grows without 
bounds over time, the corresponding observable process is stationary with finite power and has 
an approximately Lorentzian (i.e., one-pole lowpass) PSD function around the first harmonic 
[23, Eq. (41)]. In the case of a closed-loop (a.k.a. phase-feedback) system, such as a phase- 
locked loop (PLL), in the locked and steady state, the phase difference between its input and 
output signal can be modeled as a Gaussian WSS process with bounded variance. In practice, 
frequency generation is often performed by locking one or more PLLs to the output of a free- 
running oscillator, which results in an overall phase process ip{t) that is the sum of a Brownian 
motion process and an asymptotically (in time) Gaussian WSS process [25]. Generally speaking, 
the components in ip{t) corresponding to the free-running oscillator and to the PLLs dominate 
the long-term and the short-term phase process behavior, respectively. In TDMS-based MEMO 
channel sounding, the time-scale of interest, determined by the duration of one MIMO snapshot 
(i.e., the duration it takes to measure all scalar subchannels of the MIMO channel), is such that 
the behavior of the phase process is typically dominated by the component due to the PLL(s). The 
nonstationary (Brownian motion) component in Lp{t) can therefore be neglected. We shall support 
this statement in Section |VII] through measurementcl performed on a commercially employed 
TDMS-based MIMO channel sounder. 

The absolute frequencies generated by the independent clock sources at transmitter and receiver 
can differ significantly, which is accounted for by allowing a nonzero frequency offset Aco = 
27r(/T — /r). The resulting overall model, specified below, captures all relevant effects in a 

''while we did not observe tlie Brownian motion component within the duration of a MIMO snapshot in our measurements, 
we do acknowledge that it may occur for the channel sounder under consideration if the MIMO snapshot duration is sufficiently 
long. 
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simple and mathematically tractable manner. 

3) SISO signal model: We are now ready to state the SISO signal model including the effects 
of phase errors. Throughout the paper, we shall work in complex baseband. We assume that 
the (time-varying) physical channel to be measured is LTI during each snapshot interval and 
can change from snapshot to snapshot. The impulse response during the kth snapshot interval 
tk + '3t {k ^ is given by 

hk{t) = J2h!^6{t-Ti\ rt^rt^Wm^n (6) 
I 

and the individual snapshot intervals do not overlap. 
The overall signal model (in complex baseband notation) for the kth snapshot is hence given 

as 

where we set OT,bb(^) = OT(t) exp(-j27r/Tt) = exp{jcpT{t)) andoR,bb(^) = OR(t) exp(-j27r/Tt) 
= exp( — j{Au;t — (pR{t))), and n'{t) represents thermal noise at the receiver input. Straight- 
forward manipulations yield 

u,{t) = r{t) * ( y^-te-^^^^y^^^^'-^'^''\ s{t - tk - rf )) + n{t) (7) 

' 0k,l{t-tk-ri'^) 

where n{t) — r{t) * n'{t). In the absence of phase errors, i.e., (pT{t) — <fR{t) — Ao; = 0, we 
have 

Uk{t) = Yl <t -tk- ri'^) + n{t). (8) 

Now, if the interval 3c is such that r^''^ e O^c V A;, I, the peakiness of c{t) — s{t) * r{t) (within Jc) 
implies that the channel coefficients /i^'^ can be retrieved by sampling Ufc(t) at the time instants 
tk + Tk^- The measurement SNR in extracting the coefficients hf' from Uk{t) is clearly limited. 
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among other factors, by the sequence SNR. From ([7]), we can see that in the presence of phase 
errors, one has to deal with the quantity 

instead of c(t) as in ([8]). Phase errors, accounted for by the term 9k,i{t), therefore can lead to a 
shifting of the peak of c{t) (which is at t = in the phase-error free case) and to a degradation 
of the correlation properties, or equivalently, the peakiness of c{t) as quantified by the sequence 
SNR. We shall next show that, under quite general conditions, we can write 

c'kAt) = Ck,iPk,iit) (9) 

where Pk,iiO) = 1 V A;, /, and the Pk,i{t) are functions that have their peaks at t = 0. As a 
consequence of dH), we then obtain 

Ukit) = Ck,iVk,i{t -tk- 4'^) + n{t) (10) 

I 

which implies that the channel coefficients in the presence of phase errors are given by h^^^ Ck,i 
and can still be obtained by sampling Uk{t) at the time instants tk + r^'' ■ Moreover, the functions 
Pk,i{t) will be less peaky than c{t) (within the interval 3c), which results in a reduction of the 
sequence SNR and hence the measurement SNR in terms of extracting the channel coefficients 
by sampling the function Uk{t). 

A general but restrictive condition for the Pk,i{t), ^ k,l to have their peaks at t = is that 
Ok,iit), V/c,/ be narrowband relative to s{t) which would imply c^;(t) oc c(t). This "slow phase 
noise" condition, however, is in practice hardly satisfied. A more systematic approach to assessing 
the degradation of sequence SNR due to phase errors consists of decomposing 9k,i{t) into the 
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harmonics exp{jujdt) and evaluating the behavior of the quantity 

/CX) 
-oo 

which for r{t) = t) becomes Woodward's correlation function [27, Ch. 7, Eq. (17)] 

/oo 
-oo 

where s~ stands for s{—t). In the remainder of this paragraph, we restrict ourselves to r(t) = 
s*{—t), for simplicity. In the general case Woodward's correlation function is replaced by the 
cross-correlation function between s{t) and r{t). Denoting the Fourier transform of 9k,iit) by 

QkAM = I-oo^kAt)e'''''dt, it follows that 

1 

= ^ y QkAM As-it,u;) da;. 

Analyzing the quantity E{|c'^ ^(t)^}, it can be verified that for m-sequences (used, e.g., in the 
channel sounder analyzed in Section rvTIl ) Ag-{t,u!) is such that, under quite general conditions 
on the phase noise PSD, peak-shifting does not occur, i.e., c^;(t) has its peak at t = 0, V /c, Z, 
and hence dH) is satisfied. However, for sounders employing, for example, chirp sequences peak- 
shifting does occur. 

We shall next show that, under assumptions validated by our measurements in Section IVIIl 
^ can be simplified further in the sense that the c^ / do not depend on /. Straightforward 
manipulations reveal that 

where w{t) = s{t) r(—t) is a window function. For BPSK modulated m-sequences with rectan- 
gular chip pulses (used in the sounder analyzed in Section IVIIl) and r(t) = s*{—t), for example, 
we have w{t) = const, for t G 3tr- Next, assuming that 
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i) fRi^), ^Tit), and Acut do not change appreciably during an interval of length |2Jtr|, and 
thus, recalling the normalization of w{t) according to (|5]), 



/ e^^'^^ g-jVR(tfc+T^''+0 gjVT(tfc+€) ^^^^ 



(12) 



ii) the delay spread is small enough for 

Acurf^O and Mh + A'^ + - Mtk + 0, V A;, / (13) 

to hold, 

and further assuming without loss of generality (w.l.o.g.) that the interval Jtr is symmetric 
around t = 0, we obtain 

Ck,i ~ 6^'^"*'= e^'^'^ (14) 

which, obviously, does not depend on /. Here, is a real- valued WSS Gaussian random process 
with zero-mean and variance a^. We note that formally the condition that (^R{t) does not change 
appreciably during an interval of length PtrI implies the second condition in (fT3l) as the delay 
spread is significantly smaller than PtrI- We decided, however, to state the second condition 
in (fT3l) separately in order to stress that it is more critical that the approximation error in the 
second condition in (fT3l) be small. Inserting (fT4l) into (flOl) finally yields 



Uk 



(t) = Yl e^'^"*'' e^'^'= pkAt -tk- A'^) + Uk (15) 



which implies that the impact of phase noise and frequency offset is to modulate the (potentially 
frequency-selective) physical channel process according to 
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For flat-fading physical channels, considered in the remainder of the paper, we simply get 



where we set Hk = Acotk. We conclude by noting that even though the model (fT6l) is standard 
and well-known in the literature, we decided to provide a detailed derivation in order to exhibit 
the underlying key assumptions. In particular, as already mentioned, these assumptions will be 
validated in our measurements in Section IVIIl 

In practice, processing on the receive side is often implemented through periodic convolution 
rather than linear convolution, as described above. The corresponding class of sounders will be 
called circular convolution-based in the following and entails extending s(t) and r(t) periodically 
to result in the | Jp|-periodic signals s{t) and f(t), where \3p\ is given by the length of the 
sounding (transmit and receive) sequence. Consequently, 3p denotes one period of s(t) and f(t). 
The signals Uk{t), obtained by convolving, for each /, with f(t — rj:^) rather than r{t), are then 
sampled at the time instants t^. As compared to the case of linear convolution, this leads to a 
slightly modified expression for the coefficients Ck,i given by 



with w(t) = s{t)f(—t). Now, in order to arrive at the effective input-output relation (fT6l) . we 
need to slightly revise the assumptions made in the case of a linear convolution-based system. 
Instead of Condition i) leading to (fT2)) . we need to assume that v5R(t), v^t(^), and Aut do not 
change appreciably during an interval of length |3fp| and thus 



Instead of the second condition in (fT3l) . we need to assume that the delay spread is small enough 



(16) 




(17) 




(18) 
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for 



Mtk-rP +0-VT{tk + 0, ^kj (19) 

to hold. The condition ^ujt^^ is not needed. Circular convolution-based sounders have the 
advantage of reducing the required per SISO snapshot recording interval lengthO from |3fTR| + l^fd 
to pp|. The inequality |Jtr| + Pd > |3fp| is obtained by noting that in the case of circular 
convolution |Jtr| + \'^c\ = \'^t\ + \'^c\ > pp| + \'^c\, where we used r{t) = s*{—t) and the fact 
that the length of 3t is given by the length of the transmit sounding sequence plus the length of 
the transmit frontend filter impulse response. We close the discussion by noting that the channel 
sounder investigated in Section IVIIl is a circular convolution-based sounder. 

C. MIMO Signal Model 

The basic principle of TDMS-based MIMO channel sounding is to sequentially measure the 
MtMr scalar subchannels of the MIMO channel. Since the individual SISO subchannels (note 
that we continue to use the term SISO snapshot to refer to the measurement of a subchannel 
of the MIMO matrix) are band-limited stochastic processes (due to finite Doppler spread), it is 
not necessary to assume that the subchannels are static during the entire MIMO measurement 
period. Rather, it suffices to choose the sampling rate in compliance with the sampling theorem 
and to properly align the measurements in time [28]. In the remainder of the paper, we assume 
that this alignment has already been performed. 

Denoting the ejfective (i.e, the physical channel including the effect of phase errors) scalar 
subchannel between the nth (n = 1, 2, . . . , Mt) transmit and the mth (m = 1, 2, ... , Mr) receive 
antenna as hm,n = hm,n^^'9{j {l^m,n + V^m,n)), the corresponding effective MIMO channel matrix 

^The signal recorded is the signal at the output of the receive frontend filter. 
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can be expressed as 



H = Hoexp°(j(M + $)) 



(20) 







where [M]^_„ = „ and [*]m,n — Vm,n- What we would like to measure is the physi- 
cal channel matrix H. However, due to phase errors, the sounder has access to the effective 
channel matrix H only. The entries in M and $ depend on the switching pattern, i.e., the 
order in which the individual scalar subchannels are measured, and the SISO snapshot time 
distances. In the following, we denote a switching pattern as an ordered sequence of pairs 
((mi, ni), (m2, n2), . . . , (mMxMR, JT-MtMr)) where {mk,nk) means that the scalar subchannel 
h^j^^nk is being measured at time Let us consider a simple example with Mt = Mr = 2. 
The switching pattern ((1, 1), (2, 1), (2, 2), (1, 2)) leads to 



01 



Dj(M2+¥'2) pj(M3+¥'3) 



whereas the switching pattern ((1, 1), (2, 1), (1, 2), (2, 2)) results in 



0, 



The physical channel matrix H is, of course, unaffected by the switching pattern. The dependence 
of on the switching pattern and on the SISO snapshot time distances is highly problematic since 
different switching patterns and/or SISO snapshot times yield different (incorrect) measurement 
results for the same physical MIMO channel. The following simple example for Mr = Mr = 2 
illustrates this undesirable effect and its implications: Assume that the physical channel is given 
by H = 1, and jii + ^pi = ij^a+^a = 0, = — 7''/2, and /is + v's = tt/2. It is then easily seen 

that Ai((Ho0i)(Ho0i)«) = A2((Ho0i)(Ho0i)^) = 2 whereas Ai ((Ho 02) (Ho 02)^) =4 
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and A2 ((Ho02)(Ho02)^) = 0. In summary, starting from a rank-1 physical channel, depending 
on the switching pattern, we can get a rank-1 or a rank-2 effective channel. 

We conclude this section by introducing an approximation that will frequently be used through- 
out the paper. For small phase errors, we use the standard first-order Taylor-series approximation 
(see, e.g., [29, Eq. (4.12)]) 

exp°(j$) f« 1 (21) 

In the remainder of the paper, whenever referring to H, unless explicitly stated otherwise, we 
shall use the exact expression for according to (|20|) . We conclude this section by noting that, 
throughout the paper, whenever we deal with random physical channels, will be assumed to 
be statistically independent of H. 

III. Effect of Phase Errors on MIMO Channel Statistics 

In this section, we describe the impact of phase errors on MIMO channel statistics (i.e., the 
statistics of the effective MIMO channel H vs. the statistics of the physical MIMO channel H) 
thereby laying the foundations for the results in Sections |IVl |VIl and IVIII 

In the following, we consider both deterministic and stochastic physical channels H. For 
deterministic H, phase noise induces randomness and hence makes the static channel appear 
fading. In the case of stochastic H, phase errors alter the channel statistics. 

A. Mean and Covariance of Effective MIMO Channel Matrix 

We start by investigating the impact of phase errors on the mean Hf = E{H} and on the 
covariance Cov{H} of the physical channel. The developments in the sequel apply to both 
deterministic physical channels, where Hf = H and Cov{H} = 0, and random physical 
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channels. Using (|20l) . a straightforward calculation reveals that 

E{H} = v^exp°(jM) o Hf 
Cov{H} = K (mm^) o exp°(Cov{$}) o (^vec(Hf) (vec(Hf))^+ Cov{H}) (22) 

- vec(Hf) (vec(Hf))^ 

where k = exp(— o"^), m = vec(exp°(jM)), and we made use of the relations vec(A o 
B) (vec(AoB))^= (vec(A) (vec(A))^)o(vec(B) (vec(B))^) and E{exp(jX)} = exp{jmx- 
al/2) forX~Ar(mx,ai). 

We observe that phase noise leads to an attenuation of the first moment of the physical channel 
by a factor of The presence of a frequency offset (reflected by the matrix M) can result in 
E{H} having a higher rank than Hf which in turn implies that the spatial multiplexing gain of 
the effective channel can be higher than that of the underlying physical channel. Take for example 
a deterministic rank-1 physical channel with H = Hf = 1. In the absence of phase noise, we 
have E{H} = exp°(jM) which, depending on the frequency offset characteristics, can even 
have full rank. As a simple example, consider the switching pattern ((1, 1), (2, 1), (2, 2), (1, 2)) 
with = exp(j7rfc/2), which results in the full-rank matrix 



E{H} = exp°(jM) 



j 1 
-1 -J 



The conditions for frequency offset (in terms of the measurement setup) to have a significant 
impact on the measurement error in terms of MI will be discussed in Section ITV- A3 [ 

The impact of phase errors on the channel's second-order statistics is more involved. Consider, 
for example, the case of no frequency offset (M = and hence m = 1) and fully correlated 
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phase noise, i.e., Cov{^»} = a^l, representative of a high-quality LO. In this case, (|22l) yields 



Cov{H} = Cov{H} + vec(Hf) (vec(Hf))^ (1 - k) 



which shows that the mere presence of phase noise, even if it is fully correlated, alters the 
covariance matrix of the physical channel by adding a rank-1 component to Cov{H}. For a 
deterministic physical channel, where Cov{H} = 0, we can see that the presence of phase 
noise randomizes the channel and yields an effective channel with the rank-1 covariance matrix 
Cov{H} = vec(Hf) (vec(Hf))^ (1 — k). We conclude this discussion by investigating the case 
of fully uncorrected phase noise, representative of a very poor LO, still assuming M = 0. In 



this case, we have $ = cr^I and consequently 



Cov{H} 



dg(vec(Hf)(vec(Hf)) 



H 



1 K K 



\ 



K 1 K 



Cov{H}. (23) 



y K K, K ■ ■ ■ 1 J 

We can see that again for deterministic physical channels (where Cov{H} = 0) the effective 
channel has a nonzero second moment and is hence randomized by phase noise. An important 
effect is brought out by starting with a purely Rayleigh fading physical channel (i.e., Hf = 0) with 
fully correlated entries (due to insufficient antenna spacing for example) so that Cov{H} = 1. If 
the phase noise variance is high so that k = exp(— a^) is small, (|23l) implies that Cov{H} ^ I 
which amounts to a full decorrelation of the channel entries. Consequently, the effective MIMO 
channel "looks like" an independent and identically distributed (i.i.d.) channel. There is one 
subtle difference, though, to the widely used i.i.d. Rayleigh fading channel model, namely 
that the effective MIMO channel matrix will not have JG entries (see Section ITII-B|) . We can 
therefore not conclude that the entries in the effective channel matrix are statistically independent. 
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Nevertheless, the statistics of the effective channel will cause significant overestimation of the 
channel's MI and capacity (see, for example, the numerical results in Section ITV-BI) . In the case 
of fully correlated phase noise, the effective MIMO channel matrix will have JG entries and, as 
discussed above, we have (assuming Hf = 0) Cov{H} = Cov{H} so that overestimation does 
not occur. 

We close this discussion by noting that we have identified a number of possible scenarios where 
the presence of phase errors can significantly alter the MIMO channel statistics and, in particular, 
can lead to substantial rank increase of the effective channel's covariance matrix with respect 
to (w.r.t.) the underlying physical channel's covariance matrix. From a channel measurement 
point of view, the consequences are significant measurement errors in terms of MI and capacity. 
Corresponding quantitative results (analytic and numerical) will be provided in Sections |W] and 
rvTl Finally, we recall that the situation is exacerbated by the fact that measurement errors depend 
significantly on the antenna switching pattern as well as the SISO snapshot time distances. 

B. Loss of (Joint and Individual) Gaussianity 

The impact of phase noise on the MIMO channel statistics is not restricted to the first and 
second moments as discussed above. Rather, as shown below, it affects the joint and individual 
distributions of the scalar subchannels in a profound way. In the following, for the sake of clarity 
of exposition, we set M = 0. 

1 ) Effect on the joint distribution of the Rayleigh fading physical channel elements: We start 
by assuming that the physical channel is Rayleigh distributed, i.e., Hf = E{H} = and H is a 
zero-mean proper (or equivalently "circularly symmetric") complex Gaussian random matrix. We 
shall next show that even though the individual entries of H continue to be circularly-symmetric 
complex Gaussian, they will, in general, not be JG. The first part of this statement is made 
precise in the following Lemma for which, even though it is straightforward, we could not find 
a reference in the literature. 
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Lemma 1: Let h = CJ\f{0, af) and take to be a continuous RV with pdf p<^(x). The RV 
hexp{j(p) is CA/'(0, cr^), irrespective of Pip{x). 

Proof: Since \exp{jip) | = 1, it follows immediately that the pdf of \hexp{jip) \ is equal to the 
pdf of The statistics of the phase of hex.p{jip) are obtained by noting that arg(/iexp(j(^)) = 
(arg(/i,) + (^) mod 27r — z with arg(/i) being uniformly distributed in [0, 27r). Our task is therefore 
reduced to finding the pdf Pz{x). Next, denote the 27r-periodic continuation of the pdf of 
arg(/i) and of as Pf^,{x) = 1/(2%) and p^{x) = Yl^-ooPfi^ ~ ^tt/), respectively. Noting 
that arg(/i) and (p are statistically independent, we obtain Pz{x) — f^^Ph{x)p^{y — x)dx — 
l/{27i)j:^My-x)dx = 1/(271). U 

It remains to show that the elements in H will, in general, not be JG. This will be done by 
considering a simple example. Since two complex Gaussian RVs xi and X2 are JG if and only 
if the linear combination axi + bx2 is (complex) Gaussian V {a, b} E C, it suffices to show that 
z — hi exp(j(/?i) + /i2 exp(j(^2) with hi, h2 = CA/'(0, 1) will not be Gaussian if hi and /i2 are 
fully correlated, i.e., hi — /i2 and hence z — hiy where y — exp{j(pi) + exp(jip2)- A simple 
proof is obtained by writing the MGF of z as Mz{s) = E{exp{sz)} = EyE/u|y{exp(s/ii7/)} = 
Ej,{exp(|yp|sp/4)} which is the MGF of a complex Gaussian RV only if \y\ is deterministic 
(which is satisfied for ^pi and (^2 fully correlated or both being deterministic, but not in general). 

2) Effect on the joint distribution of the Ricean fading physical channel elements: Next, we 
consider a Ricean fading physical channel according to H = CA/'(Hf, Sh), where Hf ^ 0. We 
shall show that the presence of phase noise results not only in a loss of joint Gaussianity of the 
elements of H, as in the Rayleigh fading case, but also in a loss of propemess and Gaussianity 
of the individual entries in H. The loss of propemess follows by noting that 

CoVp{A} = E{(/i - E{/i})2} = E{/i2} _ (E{/i})2 = E{/i2 - {^^^hff 
= hW -Kh) = hl{K^ - 7^ 
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for cr^ > and hence /t < 1. It follows immediately that the elements of H are no more jointly 
proper as well. The loss of Gaussianity is a direct consequence of hf e^*^ not being Gaussian 
distributed. Since the marginals are not Gaussian, we can immediately conclude that the elements 
of H will not be JG either. 

3) Special cases: We conclude this discussion by dropping the assumption M = and 
identifying two interesting special cases where, despite the presence of phase errors, the effective 
MIMO channel matrix has JG entries. 

The first case is that of a deterministic physical channel H = Hf subject to "small" (in the 
sense of approximation (|2T)) being appropriate) phase noise. More specifically, we have 

H = Hf o exp°(jM) o (1 + 

Since the entries in $ are samples of a zero-mean (real) Gaussian process, any finite set of such 
samples is zero-mean JG which results in the entries of H being JG. The corresponding first 
and second moments are given by 

E{H} = Hf oexp°(jM) 
Cov{H} = (mm^) o (vec(Hf) (vec(Hf))^) o Cov{$}. (24) 

Note that while the entries in H are JG, they will neither be jointly proper nor individually 
proper in general so that the second-order description in (l24l) is incomplete without specifying 
the pseudo-covariance matrix obtained as 

CoVp{H} = -(mm^) o (vec(Hf) (vec(Hf))^) o Cov{$}. 

The second special case is that of an i.i.d. purely Rayleigh fading physical channel H. 
Defining h = vec(H) = CA/'(0,I), we want to show that Dh = CA/'(0,I), where D = 
diag([exp(j(;Ui + v?i)) exp{j{fi2 + '^2)) ■■■ exp(j(/iMTMR + V5AfTAfR)) ])• We start by noting 
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that the characteristic function of h is given by [30, Eq. (20)] 



= e 



The characteristic function of Dh is obtained as 



ED{e 




= e 



where we made use of the fact that DD = I. We have therefore shown that i.i.d. purely 
Rayleigh fading physical channels are not affected by phase errors. 

In closing, we would like to remark that the effective channel not having JG entries, in general, 
is one of the main factors contributing to the difficulties in analyzing the impact of phase errors 
on MI. 



Having seen in the previous section that phase errors can alter the MIMO channel statistics 
significantly, the purpose of this section is to analyze the corresponding impact on MI for random 
physical channels. Analytic results for the general (arbitrary rank (with probability (w.p.) 1) of the 
physical channel) case seem very difficult to obtain. We shall therefore restrict our discussion 
to identifying cases where the MI is not affected even though the channel statistics are. In 
addition, representative numerical results bringing out the key consequences of phase errors 
will be provided. Analytic results for deterministic physical channels and for (deterministic or 
random) rank-1 physical channels will be provided in Sections |V] and |VIl respectively. 

We analyze a MIMO channel with input-output relation 



IV. Effect of Phase Errors on Mutual Information 



r = Hs + n 



where s is the Mt x 1 transmit vector, r is the Mr x 1 receive vector, and n is an Mr x 1 



26 

noise vector distributed as CA/'(0,Imr)- Assuming no channel state information (CSI) at the 
transmitter and perfect CSI at the receiver, the MI (in bit/s/Hz) of this channel is given by [31] 



where the input signal vector was assumed to be circularly-symmetric complex Gaussian with 
covariance matrix (p/Mx) I a/t ^i^d P is the average SNR at each of the receive antennas. 

The purpose of this section is to study how I changes when H in (1251) is replaced by H in (l20l) . 
i.e., to analyze the statistics of / = logdet(lAfj,+(p/MT)HH^) = logdet(lMT + (p/^T)H^H). 

A. Cases Where Mutual Information is Not Affected 

In the last paragraph of Section |III1 we showed that i.i.d. Rayleigh fading channels are not 
affected by phase errors in the sense that the effective channel H is i.i.d. Rayleigh fading as well, 
irrespectively of the statistics of the phase errors. We have furthermore seen (Section [ill- Al) that 
for correlated Rayleigh or Ricean fading physical channels, phase errors can have a significant 
impact on the channel statistics and thus on the corresponding MI. There are cases, however, 
where even though the statistics of the effective channel H differ from the statistics of H, we 
still have 1 = 1. Intuitively, this happens because of the quadratic dependence of / on H. In the 
following, we shall discuss two such practically very relevant cases. 

1) The low-SNR regime: Phase errors have no impact on MI in the low-SNR regime, irre- 
spectively of the physical channel's statistics and the phase error statistics. This can easily be 
seen by noting that for low SNR 



which, combined with the fact that ||H||| = ||H|||, proves the statement. In general, we can 
conclude that the impact of phase errors on MI is more pronounced for higher SNR. This is 
because phase errors lead to a rank increase of the MIMO channel and high-SNR MI depends 




(25) 
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Strongly on the rank (or multiplexing gain) of the channel, whereas low-SNR MI depends only 
on the Frobenius norm ||H||f. 

2) One-sided switching or fully parallel sounding: For MIMO channel sounders where either 
the transmitter or the receiver employs one RF chain per antenna (i.e., parallel sounding) and 
hence no switching is necessary on the corresponding side of the link, the effective channel 
matrix is given by H = DrH^t and H = dRHDx in the case of switching only at the receive 
and the transmit side, respectively. Here, and rfx as well as the entries of the diagonal matrices 
Dr and Dt are of the form exp(j(/i + ip)). Even though (f22)) implies that the statistics of the 
effective channel H are different from the statistics of the physical channel H, it is easily seen 
by direct insertion into (|25l) that the MI is not affected by "one-sided" phase errors. Obviously, 
this is also true for fully parallel (both the transmitter and the receiver employ one RF chain per 
antenna) MIMO channel sounders. 

We would like to add a word of caution. In practice, the MI of measured MIMO channels is 
often not evaluated by directly inserting the measured channel realizations into the MI formula 
(|25l) . Rather, the measurement results are used to extract parameters of a statistical MIMO 
channel model, e.g., the power-angular spectrum (R\S) or the distribution of the DOA. The 
resulting statistical description of H is then used to evaluate the channel's MI. Now, one-sided 
switching does, in general, entail errors in the estimation of the parameters of a statistical MIMO 
channel model so that the procedure described above will lead to, in general significant, errors in 
MI. This brings out an interesting and practically very relevant point. While one-sided switching 
does not entail errors in MI when the measurement results are used to directly evaluate the MI, 
significant errors can be expected if one takes a detour via a specific statistical MIMO channel 
model. 

3) Impact of frequency offset for separable timing matrix: First, we define the timing matrix 
[T]mfe,nfe = tk of size Mr X Mt, which contains the SISO snapshot measurement times in matrix 
form arranged corresponding to the switching pattern. We call T separable if it can be written 
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in the form [T]m,n = [^K]m + [^tIu, where tx and are the transmit and receive timing vectors 
of size Mt X 1 and Mr x 1, respectively. An example of a switching pattern and SISO snapshot 
times leading to a separable timing matrix is given by the regular sounding pattern (where 
sounding pattern denotes the combination of a switching pattern and a set of SISO snapshot 
times) 

^fc = ~ 1) niod Mr + 1 

nfc = (A;-l)divMR + l (26) 

tk = T^iruk - 1) + TT(nfc - 1) 

with k = 1, 2, . . . , MtMr, characterized by the timing parameters Tt,Tr G This corre- 
sponds to starting with transmit antenna 1, switching through the receive antennas 1, 2, ... , Mr 
sequentially, then switching to transmit antenna 2, again switching through the receive antennas 
sequentially with the same SISO snapshot time distances between the receive antennas as before, 
and so on. Likewise, if we start with receive antenna 1, switch through the transmit antennas 
sequentially and so on, a separable timing matrix will be obtained. 

Now, for [T]^,„ = [tR]^ + [tT]„, we have exp°(jM) = exp°(jAa;tR) (exp°(jAu;tT))^ and 
hence by Theorem |4] in Section |Vll it follows that the eigenvalues of H o exp°(j(M + $)) are 
equal to the eigenvalues of Hoexp°(j$). This implies that for T separable, the frequency offset 
has no effect on MI. In practice, one typically has control over the sounding pattern. In the light 
of what was said above, it is therefore sensible to choose the sounding pattern such that T is 
separable. Of course, even if MI is not affected, frequency offset can still cause significant errors 
in other parameters, as discussed in the last paragraph of Section irV-A2[ We note, however, that 
in contrast to phase noise, frequency offset, due to its deterministic nature, can be estimated and 
mitigated with relative ease [5]. In the remainder of the paper, we shall, therefore, often neglect 
the frequency offset and consider H o exp°(j$) only. 
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B. Numerical Results 

We shall next provide numerical results to quantify the impact of phase errors on MI. In 
particular, we will also quantify the impact of the scalar subchannels in the effective MIMO 
channel not being JG distributed. 

1) Ergodic capacity increase due to phase errors: We examine an Mt = Mr = M = 8 
physical channel (with receive antenna correlation) given by H = R^/^H„, where R = R^/^R^/^ 
denotes the receive correlation matrix and the entries of H„ are i.i.d. CA/'(0, 1). We choose R 
such that Ai(R) = M/r(R) for i = 1, 2, ... , r(R) and Aj(R) = otherwise. This normalization 
ensures that Tr(R) = M, irrespectively of the rank of R. Employing Monte Carlo simulations, 
Fig. [2] shows the ergodic capacity ^ C = IE{/} of the effective channel H as a function of SNR for 
varying r(R), varying with fully uncorrected (i.e., worst-case) phase noise, i.e., Cov{$} = 
cr^ ImtA/r. and no frequency offset. We have chosen 3.5° and 7°rms phase noise (corresponding to 
cr^ ~ 0.0037 and cr^ ~ 0.0149, respectively) as typical and worst case values, respectively. These 
values were derived from our measurements on a commercially employed TDMS-based MIMO 
channel sounder (see Section rvlll) . In general, phase noise correlation properties quantified by 
Cov{$} and phase noise variance cr^ depend on the LO characteristics, SISO snapshot times, 
and the number of transmit and receive antennas. Fully uncorrelated phase noise, as assumed 
in this example, corresponds to relatively long SISO snapshot time distances and represents the 
worst (though not necessarily untypical) conditions. We can see in Fig.[2l that in none of the 
considered cases phase noise results in a reduction of ergodic capacity. Moreover, we observe, 
in agreement with what was shown in Section lTlI-BSl that in the case of i.i.d. physical channels 
(i.e., r(R) = 8 in this examplo phase noise has no impact at all on ergodic capacity. In the 
r(R) = 1 case, at p = 35 dB for the typical rms phase noise value of 3.5°, the error in ergodic 

^Throughout the paper, we tacitly assume that the effective channel is ergodic. 

'Recall that the nonzero eigenvalues of R were chosen to be equal, which together with r(R) — 8 implies that the entries of 
H = Ri/2jj^ i i ^_y^(o, 1). 
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capacity due to phase noise is about 100%. For the worst-case value of 7°rms phase noise, the 
error, again at p = 35 dB, is about 175%. Furthermore, for r(R) = 1 and rms phase noise of 
3.5°, the multiplexing gain of the effective channel (i.e., the ergodic capacity pre-log, obtained by 
determining the high-SNR slope of capacity as a function of SNR) lies between 4 and 8, making 
the effective channel look like a MIMO channel with rank between 4 and 8. Furthermore, as 
predicted by theory, we can see that phase noise has little impact in the low-SNR regime. We 
conclude this simulation example by noting that the impact of phase noise is most pronounced 
for low-rank physical channels at high SNR. This is one of the motivating factors for the detailed 
analysis of the rank-1 physical channel in Section IVTl 

2) Impact of loss of joint Gaussianity: As demonstrated in Section ITlI-B I albeit the individual 
elements of the effective MIMO channel associated with a physical purely Rayleigh fading 
channel will be circularly- symmetric complex Gaussian distributed, they will, in general, not 
be JG distributed. Characterizing the consequences of this loss of joint Gaussianity analytically 
seems very difficult as known techniques (e.g., [32]-[37]) for deriving analytic expressions for 
ergodic and outage capacity of MIMO channels (or bounds thereon) almost exclusively hinge 
on the assumption of the elements in H being JG distributed. There are two practically relevant 
points related to the loss of joint Gaussianity, which will be brought out through numerical 
results in the following. 

We investigate a 4 x 4 purely Rayleigh fading (i.e., Hf = 0) physical channel with two 
different correlation levels according to [Corr{H}]m,n = 0.7 and 0.95, respectively, iox m ^ n 
in the presence of fully uncorrected 7° rms phase noise and no frequency offset. For p = 20 dB, 
Fig. [3] shows the cdf of MI for the physical channel H, the resulting effective channel H, and a 
synthetic MIMO channel obtained by assuming that the channel matrix is circularly-symmetric 
complex Gaussian with covariance matrix according to (l23l) . i.e., [Cov{H}]m^„ = 0.7k and 0.95k, 
respectively, for m 7^ n. In addition, the cdf of MI under the linear phase noise approximation 
((2T)) is shown. Before proceeding, we note that the synthetic channel has the same first and 
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second order statistics as the effective channel and should hence exhibit (when compared to the 
effective channel) the impact of the loss of joint Gaussianity due to phase errors. 
We can now draw the following conclusions for this specific setup: 

• The impact of phase noise is significant at the (high) correlation level of 0.95. In addition, 
comparing the synthetic channel to the effective channel, we can see that neglecting the loss 
of joint Gaussianity would result in a slight overestimation and a significant underestimation 
of outage capacity at low and high outage levels, respectively. For outage levels of practical 
interest, i.e., up to 30% outage probability, the error in outage capacity (due to the loss of 
joint Gaussianity) is not more than 8%. 

• The impact of phase noise is negligible at the correlation level of 0.7. The synthetic channel 
exhibits a behavior that is very close to that of the effective channel. Interestingly, a 
correlation level of 0.7-0.75 is often quoted in the literature [38], [39] as a threshold above 
which "the channel starts behaving as highly correlated". 

• Even for the worst case rms phase noise value of 7°, the linear phase noise approximation 
(|2T|) yields very accurate results. This approximation will be used extensively in Section [VTl 

With regards to Ricean fading physical channels, we content ourselves with noting that for 
high Ricean K-factors, the situation approaches that for deterministic physical channels (studied 
in Sections |V] and [VIl) . whereas for low Ricean K-factors, the behavior will be close to that of 
the purely Rayleigh fading case (treated in this section and in Section EH). 

V. Sensitivity Analysis 

Throughout this section, we assume a separable timing matrix in which case frequency offset 
has no effect on MI and can therefore be neglected. For the sake of simplicity of exposition, we 
shall furthermore assume Mr < Mp. The purpose of this section is twofold. First, we introduce 
a tool for evaluating the sensitivity of the MI of a fixed physical channel to phase noise. This 
will be accomplished by computing the first two terms in the Taylor series expansion of 
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around the phase-noise-free case $ = 0. Based on this framework, we will then be able to 
provide analytic expressions for approximations of the first and second moment of the MI of 
the effective channel for arbitrary phase noise covariance matrix. 

We shall be concerned with computing the second-order Taylor series expansion of 



where J7}(0) denotes the 1 x MtMr Jacobian matrix (vector, in this case) and 7i/(0) is the 
MtMr X MtMr Hessian matrix of / = /($) at $ = 0. Clearly, we have /(O) = /. We shall see 
below that the second-order Taylor series expansion of MI is, in general, accurate for full-rank 
physical channels, but tends to yield loose approximations for rank-deficient physical channels. 
This problem can be mitigated by either using more terms in the Taylor series expansion of 
MI or by performing the Taylor series expansion on the channel's eigenvalues rather than on 
the channel's MI itself. Both approaches are, in general, cumbersome. The latter approach will 
not be detailed here, for brevity of exposition, but will be outlined briefly in Section |^Cl Even 
though the second-order Taylor series expansion of MI does not yield accurate approximations in 
the case of rank-deficient physical channels, explicit expressions for the Jacobian matrix j7/(0) 
and the Hessian matrix 7i/(0) can be used to test whether has an extremum at $ = 0. 

A. Sensitivity Analysis 

Even though the computation of the Jacobian matrix J}(0) and the Hessian matrix ?i/(0) 
does not pose any major technical difficulties, it still requires the application of tools that are 
not completely standard, namely matrix differential calculus [12] and matrix- variate Wirtinger 
a.k.a. CM calculus as described in [40]. We shall therefore present the corresponding derivations 



/(*) = log det I,,, + (H o exp°(j$)) (H o exp°(j$)) 




around $ = given by 




(27) 
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in some detail. 

Theorem 1: The 1 x MtMr Jacobian matrix (vector) J7}(0) in (|27|) is given by 

j7/(0) =21og(e)Im(^(^vec((Y-iH)oH*))^^ (28) 



where 



Mt 



The MtMr X MtMr Hessian matrix Hj{0) in ^ is given by 

HjiO) = 21og(e) Re(^divec(H)K(MT,AfH) ((H^Y-^)^ ® H^Y-i)divec(H) 

+ divec(H) ((Imt - H^Y-^H) ® (Y-i)^)divec(H*) (29) 
-divec((Y-iH)oH*)^. 

Proof: We start by defining 

Y = ^Im, + (H o exp°(j$)) (H o exp°(j$))'' 

P V V ' 

H H^^ 

SO that Y = Y|#=o and hence / = logdet((p/MT)Y). The strategy used in the proof of both 
parts of the statement is to compute dJ and d^J and to bring the resulting expressions into the 
form 

dl = A vec(d$) 

d^J = (vec(d$))^Bvec(d*) (30) 
which will then allow us to apply the first [12, Ch. 5, Th. 6] and the second [12, Ch. 6, Th. 6] 
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identification theorem for a real-valued function of real-valued parameters to conclude that 

J;(0) = A and HjiO) = i(B + B^). 

Computing the Jacobian matrix: Using the basic rules of differentiation together with the 
relation dlogedet(A) = Tr(A-MA) [12, Sec. 8.3, Eq. (11)], we obtain 

dlogdet((p/MT)Y) = log(e) Tr (|y-My) . 

Applying the product rule d(AB) = (dA)B+ AdB [12, Sec. 8.2, Eq. (15)] and d(A^) = (dA)'^ 
[12, Sec. 8.2, Eq. (18)], we get, using Wirtinger calculus, 

dY = d(HH^) = (dH) + H (d(H*))^. 

Withd(AoB) = (dA)oB+Ao(dB) [12, Sec. 8.2, Eq. (17)], we have dH = d(Hoe) = HodG 
and d(H*) = H* o d(e*). Noting that [d(/°(X))]^,„ = d([/°(X)]^,„) = d(/([X]„,„)), we 
obtain 

d log det ((p/Mt) Y) = log(e) Tr ^Y'^ jH (H o d$) ^ - ^ (H o d$) H^) ^ 

= log(e) Tr (y-^ Qh (H o d$) ^ - j {w (H o d$) ^j* ^ ^ 
= 21og(e) Im (^Tr(^Y-^H(Hod$)^)) 

= 21og(e)Im(^Tr(^(Y"iH)^(H*od$))) . (31) 
It remains to turn (|3TI) into the form dJ = A vec(d$). This can be done by first showing that 

Tr(A(B o O) = (vec(A^ o B))^vec(C) (32) 
and then applying (|32l) to (131]) . In order to prove (|32l) . we start by noting that with Tr(A^B) 



BAUM AND BOLCSKEI; INFORMATION-THEORETIC ANALYSIS OF MIMO CHANNEL SOUNDEMG 35 

= vec(A)^vec(B) [12, Sec. 2.4, Eq. (4)], we have 

Tr(A(B o O) = (vec(A^))^vec(B o C) 

which upon application of 

vec(A o B) = divec(A) vec(B) (33) 
yields the desired result. Finally, applying (|32] ) to (|3T1) . we obtain 

dlogdet((p/MT)Y) = 21og(e) Im|^(^vec((Y~^H) o H*))^^ vec(d$) 

which proves (l28l) . 

Computing the Hessian matrix: We start by noting that 

d2logdet((p/MT)Y) = log(e) dTr(Y-i dY) = log(e) Tr(Y-^ (d^Y) - (Y'^ dY)^) (34) 

where we used dTr(A) = Tr(dA) [12, Sec. 8.2, Eq. (20)] and d(A-i) = -A-^{dA)A-^ [12, 
Sec. 8.4, Eq. (1)] along with the product rule. In order to keep the following exposition simple, 
we set dH = jHod$ = jH so that dY = d(HH'^) = jH — jH H-^. Expanding the second 
term on the right-hand side (RHS) of (|34|) through similar manipulations as in the derivation of 
the Jacobian matrix, and using (Y^^)^ = (Y~^)*, we get 

-Tr((Y-MY)2) 

= Tr(Y-i(HH^ - HH^)Y-i(HH^ - HH^)) 

= Tr(Y-iHH^Y"^HH^) - Tr(Y"iHH^Y"^HH^) 

+ Tr (H* (Y-^)*H* (Y^^)*) - Tr (H* (Y-^)*H* (Y^^)*) 

= Tr(Y-iHH^Y-^HH^) - Tr(Y-iHH^Y-^HH^) 
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+ Tr((Y-^)*H* (Y-^)*H* H^) - Tr((Y-i)*H* (Y-^yW H^) 

= 2 Re (^Tr(Y"^ HH^ Y"^ HH^)) - 2 Re (^Tr(Y-^ HH^ Y"^ HH^)) 

= 2Re(^Tr(H^Y-iHH^Y-iH)) - 2 Re (^Tr ((Y-i)^H*H^ (Y-i)*H*H^)) . (35) 

Next, applying [12, Ch. 2, Th. 3] 

Tr(ABCD) = (vec(D^))^(C^ ® A) vec(B) = (vec(D))'^(A ® C^) vec(B^) (36) 

with A = B.^Y-\ B = H, C = B.^ Y \ and D = H to the first term on the RHS of ^ 
and with A = (Y-^)^, B = H*, C = (Y-i)*H*, and D = to the second term on the 
RHS of (135]), we obtain 

-Tr((Y-MY)2) = 2Re((vec(H^))'^Rivec(H)) - 2 Re(^(vec(H^))^R2 vec(H^)) 

= 2 (vec((d$)^))\e(divec(H^) Ri divec(H)) vec(d$) 

^ (37) 
- 2 (^vec((d$)^)) Re(divec(H^) Ra divec(H^)) vec((d$)^) 

where we set Ri = (H^ Y^^)^ ® Y^^ = (Y^^ ® H*)^(H* ® Y^^) and Rs = (Y^^)^ ® 
jj// Y^i H, and we used H = H o d$ and (l33l) in (a). Next, we need to rewrite (l37l) in terms 
of (vec(d$))^ and vec(d$) only, which requires getting rid of the terms (d$)^ inside the 
(vec(-))^. Upon applying ([B in (|37l ). we obtain 

-Tr((Y~MY)2) = 2 (vec(d$))^Kj;^^^ j^,^^)Re(divec(H^) Ri divec(H)) vec(d$) 

- 2 (vec(d$))^KfA,^^A,^)Re(divec(H^) R2 divec(H^))K(A^^,MT)vec(d#) 
which using K[^ „) = K^n,m) [12, Sec. 3.7, Eq. (2)] and 

(vec(A))^K(„,^) divec(B) = (vec(A))^divec(B^) K(„,„) (38) 
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results in 



Tr((Y-MY)2) 

= 2 (vec(d$))^Re(divec(H)K(A/^,A/j,) Ri divec(H)) vec(d$) 



(39) 



- 2 (vec(d$))'^Re(divec(H)K(MT,MR,) R2 K(MR,A./T)divec(H*)) vec(d$). 

The validity of (|38] ) can easily be seen by noting that divec(B) 'K(m,n) vec(A) = 
divec(B) vec(A^) = vec(B o A^) = vec((B^ o A)^) = K(„,„)vec(B^ o A) = 
K(^rn,n) divec(B^) vec(A). 

Finally, employing [12, Sec. 3.7, Eq. (5)] 



to the m X n matrix A and the p x q matrix B, we can simplify the second term on the RHS 
of dm) to obtain 



It remains to turn the first term on the RHS of ([341) into the form of the RHS of ^0^. To this 
end, we start by noting that, using Wirtinger calculus, 

d^Y = dd(HH^) = d(jHH^ - jHH^) = -HH^ + HH^ + HH^ - HH^ (42) 

where we set d^H = — H o d$ o d$ = — H. Next, inserting (|42l) into the first term on the RHS 
of dSll), we get 

Tr(Y-^ d^Y) (43) 



K(p,„)(A ® B)K(„,g) = B ® A 



(40) 



Tr((Y-MY)2) 




(41) 
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(a) 



2Tr(Y-^HH^) -Tr(Y-^HH^) -Tr(Y-^HH^) 
2Tr(Y-iHH^) -Tr(Y"^HH^) -Tr((Y-i)*H* H'^) 

2 Tr (Y-^im^ - Re (Y^^HH^) ) (44) 
2Tr(Y-iHlA/^H^) - 2 Re(^Tr (H^Y-^H)^ 

2 (vec(H^))'^((Y-i)^ ® iMT)vec(H^) - 2 Re(^(vec((H^Y-i)^))^vec(H) j 

i 2 (vec(d$))'^K(A,,,M,) divec(H^) ((Y-^)^ ® Im,) divec(H^) K(a,^,Mt) vec(d$) 

- 2 Re (^vec ((H^ Y~i)^) ) ^divec(H) divec(d$) vec(d$) 

= 2 (vec(d$))^divec(H) (Ia/^ ® (y~Y) divec(H*) vec(d$) 

(45) 

- 2 (vec(d$))^Re('divec((H^Y-i)^ o H)) vec(d*) 



where (a) results from applying (l36l) with A = Y^^, B = H, C = 1^/^, and D = to 
the first term, transposing the result, and, as before, applying Tr(A'^B) = vec(A)'^vec(B) with 
A^ = H^Y^^ and B = H to the second term. Step (b) is a consequence of applying (|33l) . the 
commutation relation ([T]), and = 'K[n,rn) ■ To obtain (c), we used (|38l) and (|40l) for the 

first term and 

(vec((H^Y"i)'^))^divec(H) divec(d#) = (vec((H^Y-^)^))^divec(d$) divec(H) 

= (vec(d$))^divec((H^Y~^)^oH) 

for the second term. The final result follows by identifying (l45l) and (|4T)) with the RHS of (|30l) . 
noting that H = H for # = 0, and applying the second identification theorem. All the terms, 
except for the first term in (l45l) . can be verified to be real-symmetriqj so that (1/2) (B+B^) = B. 
The first term in ([45]) is Hermitian and hence (1/2) (B + B^) = Re(B). ■ 

*A matrix X is said to be real-symmetric if X = X'^. 
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B. Approximations for First and Second Moment of MI 

Even though the physical channel is deterministic, the effective channel H = H o exp°(j$) 
will be random due to phase noise. We shall next compute approximations of the effective 
channel's ergodic capacity C = E{/} and of Var{/} based on the second-order Taylor series 
expansion (ITTI) . For an explanation of the operational significance of Var{/} the reader is referred 
to Section EI] and to [33]. Unlike the results in Section |Vll which are restricted to the (extreme, 
but not necessarily untypical) case of fully uncorrelated phase noise, we will allow a general 
phase noise covariance matrix, i.e., $ = A/'(0, S^). Noting that J}(0)vec($) is zero-mean 
Gaussian and Q| = Q|(vec(#)) = (vec($)) 7^/(0) vec($) is a quadratic form in real- valued 
Gaussian RVs with distribution [41, Eq. (4.1.1)] 

MtMr 

1=1 

where the Xi = xli (^ = 1,2,..., MtMr) are statistically independent, straightforward manip- 
ulations reveal that 

E{/($)} = /(O) + ^ TriHiiO) S^) (46) 

and 

Var{/($)} = J,iO) ^^jfiQ) + ^Tr((7^;(0) S^)'). (47) 

Inserting (l28l) and (|29l) into (|46|) and (l47l) . we have analytic approximations of the ergodic 
capacity and the variance of the MI of the effective channel as a function of the physical channel 
and of the phase noise covariance matrix S^. The following numerical results demonstrate that 
these approximations tend to be quite accurate for full-rank physical channels but rather loose 
for rank-deficient physical channels. 

Numerical results: In Fig.lH we plot E{/($)} and (Var{/($)}) in ([461) and (HT]), re- 
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spectively, versus the exact values E{/($)} and (Var{/($)}) obtained by Monte Carlo 
simulation over 10000 phase noise samples with 3.5° rms fully uncorrelated phase noise. Each 
point in the figures represents a pair of exact and approximate MI first and second moments 
for one of 2000 realizations of an i.i.d. physical channel with Mt = Mr = 4 at p = 30 dB. 
We observe that the deviation of the first moment of MI can be positive or negative and is 
essentially independent of the exact value of the first moment. The deviation of the second 
moment is predominantly positive, and the accuracy shows a strong dependence on the exact 
value of the second moment. Quantitatively speaking, the deviation of the estimates E{/($)} 

~ 1 /2 

and (Var{/($)}) is significant for about 5% of the i.i.d. channel realizations. 

C. Taylor Series Expansion of the Channel 's Eigenvalues 

To obtain more insight into the quality-of-fit of the second-order Taylor series approximation of 
MI, we show in Fig. [5J(a)|the cdfs of the exact MI and the MI obtained through the approximation 



(1271) . In all cases, Monte Carlo simulation at p = 30 dB with 3.5° rms fully uncorrelated phase 
noise was employed. As (deterministic) physical channels we have chosen balanced (i.e., all 
nonzero singular values of the channel matrix are equal) rank-M 4x4 channels for M = 
1,2,3,4 and an unbalanced full-rank channel. All physical channels were normalized to satisfy 
||H||p = MtMr = 16. The figure shows that the Taylor series approximation is very loose 
for rank-1 and rank-2 physical channels, acceptable for the rank-3 physical channel, and very 
accurate for the two full-rank physical channels. This example shows that the second-order Taylor 
series expansion of MI tends to yield poor approximations for low-rank or, more generally, poorly 
balanced physical channels, i.e., physical channels with large eigenvalue spread. An alternative 
approach for obtaining approximations of / is to compute a second-order Taylor series expansion 
of the unordered (but continuous w.r.t. $) eigenvalues A^^''-~„($) where 

/ o HH^ ^ ' 



Mr 
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Before briefly outlining how this can be done analytically, we show in Fig. [5J(b)| the result 



of replacing (|T7l) in Fig. [5 J(a)| by the second-order Taylor series expansion of the eigenvalues 



Agg^($) used in the RHS of (|48l) . It is clearly seen that the Taylor series expansion of the 
eigenvalues -^ggnl^) yields outstanding accuracy and significantly better results than the Taylor 
series expansion of MI directly. A few comments on how to obtain analytic expressions for the 
second-order Taylor series expansion of the -^gg/zl*) ^re in order. For the sake of space and 
focus of the paper, we shall not present the details, but rather refer the interested reader to suitable 
references. In general, obtaining analytic expressions for the Taylor series of the -^ggHl^) 
difficult and tedious; in fact, in general, more tedious than computing the Taylor series expansion 
of MI directly. While the case of a physical channel with nonrepeated eigenvalues can be treated 
with relative ease by employing [12, Ch. 8, Ths. 7, 8, 10, and 11], the general case of physical 
channels that have eigenvalues of multiplicity larger than 1 (e.g., multiple eigenvalues equal 
to zero in the case of rank-deficient physical channels) is significantly more involved. Results 
relevant in this context can be found in [12, Ch. 8, Sec. 12] and in [42]. The main difficulty in 
obtaining analytic expressions in the case of repeated eigenvalues is that the results depend on 
the eigenvectors of the channel. 

VI. The Rank-1 Physical Channel 

As already mentioned in Section |IVl the impact of phase errors is more pronounced for low- 
rank physical MIMO channels. In the following, we shall therefore analyze the extreme case 
of a rank-1 physical channel in detail. In practice, deterministic rank-1 channels occur in line- 
of-sight (LOS) scenarios with small angle-spread [2] (green-field like propagation conditions). 
Stochastic rank-1 MIMO channels are channels where the realization of the MIMO channel 
matrix has rank 1 w.p.l. A prominent member of this class of channels is the pin-hole [2] 
or key-hole [3], [10], [11] channel reflecting propagation conditions with significant scattering 
close to the transmitter and the receiver and at the same time long distances between transmitter 



42 

and receiver. Finally, the following MIMO channel sounder "calibration procedure" provides a 
practical motivation for studying (and quantifying) the impact of phase errors on rank-1 physical 
channels. The main idea underlying this calibration procedure is based on the fact that connecting 
transmitter and receiver in a TDMS -based sounder by a cable results in a deterministic rank- 
1 physical channel H = al, where a G C is the gain corresponding to the constant (across 
frequency) cable transfer function. The channel sounder then acquires samples of the effective 
channel matrix which contains channel coefficients that (after power normalization) have unit 
magnitude and a phase that varies due to phase errors created by the sounder. An inspection of the 
resulting eigenvalue histogram yields the number of significant eigenvalues and the corresponding 
eigenvalue distribution. Since the underlying physical channel has rank 1, it follows that any 
additional (w.r.t. the one resulting from the physical channel) significant modes in the effective 
channel must necessarily be due to phase errors (and/or potentially other imperfections in the 
measurement equipment). This "calibration measurement" can therefore loosely be interpreted 
as revealing the highest possible rank increase due to phase errors. 

We shall see that, unlike the general case discussed in Sections |IV] and |Vl rank-1 physical 
channels allow to establish a number of insightful analytic results on the impact of phase errors on 
MI. Throughout this section, unless explicitly stated otherwise, the results are valid for a general 
(i.e., not necessarily separable) timing matrix T. We consider channels given by H = gh^ 
where the vectors g and h can be either deterministic or stochastic (and with entries that are 
not necessarily unit modulus). 

Let us start with a simple basic result which will be needed later in this section. 

Lemma 2: The Hadamard product of a rank-1 matrix H = gh^ and an arbitrary matrix 
can be written as a matrix product according to 

(gh^) o = diag(g) diag(h). 
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Proof: The elements of a rank-1 matrix H = gh-^ are given by [H]m,n = [g]m[h]„. 
Consequently, we have [Ho0]^ „ = [g]m[h]„[0]m,n- On the other hand, it follows immediately 
that [diag(g) diag(h)]„_„ = [g]„ [0]„,„ [h]„ which concludes the proof. ■ 

The following three Theorems state that a physical rank-1 channel subject to severe enough 
phase errors results in a full-rank effective channel. 

Theorem 2: For a rank-1 physical channel H = gh^ subject to phase errors with in (|20l ) 
having full rank w.p.l, we have 

det(HH^) = ( n |[g].|% 1 det(00^), Mr < Mt 
det(H^H) = n I W^T'^* det(0^0), Mr > Mt 



,i=l 



where 

Il2 ||2 ||2 ||2 

min [h] j < Si < max [h] J , min [gJJ < z/j < max [gJJ . 

i ' ' i ' ' i ' ' i ' ' 

Proof: We provide the proof for the case Mr < Mt only. The proof for Mr > Mt follows 
exactly the same line of reasoning. We start by noting that Lemma [2] implies 

det(HH^) = det(diag(g) 0diag(h) (diag(h))^0^ (diag(g))^) 

ni[g].r n^^(®di^g(h) (diag(h))^0^) 



^i=l / i=l 



^j=i / i=i 

'A/r \ AfR 

(b) / TTl r 1 |2 



,i=i / i=i 



where the second product on the RHS of (a) is taken over the Mr nonzero eigenvalues of 
(diag(h))^0^0 diag(h) only (note that the Aj are ordered as defined in the Notations section) 
and (b) follows from a Corollary to Ostrowski's Theorem [43, Corollary 4.5.11]. ■ 
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Theorem [2] thus states that a physical rank-1 channel subject to phase noise such that has 
full rank w.p.l results in a full-rank effective MIMO channel (provided that [g]m 7^ 0, Vm, 
and [h]„ 7^ 0, Vn). For a deterministic physical rank-1 channel, the resulting effective channel 
will be stochastic and will have full rank w.p.l. The condition of having full rank w.p.l may 
sound stringent. It turns out, however, that a full-rank phase noise co variance matrix Cov{$} 
is sufficient for to have full rank w.p.l. This statement can be formalized as follows. 

Lemma 3: A real Gaussian random matrix $ G M*^r^*^t where vec ($) = A/'(0, Cov{$}) 
with det(Cov{$}) > 0, has full rank w.p.l. The matrix = exp°(j$) has full rank w.p.l as 
well. 

Proof: We follow the direct proof of [44, Th. 2.3, p. 712], where it is shown that for an 
Mr X Mt random matrix X to be full rank w.p.l, it is sufficient to have the multivariate 
distribution of X be absolutely continuous w.r.t. MRMx-dimensional Lebesgue measure. This 
condition is trivially satisfied by $ with vec($) = A/'(0, Cov{$}) and det(Cov{$}) > (see, 
e.g., [45, Sec. 4.7.2]). 

The second part of the statement can be proved by using [46, Lemma 3], which states 
that has full rank if either Re(0), lm(0), or [(Re(0))^ (lm(0))^]^ has full rank. 
Direct computation reveals that the multivariate pdf of (Re(0))^ = cos°(0) is continuous 
and integrable (in the interval [—1, I^^^t^^r) so that its multivariate cdf is absolutely continuous 
w.r.t. Mr Mt -dimensional Lebesgue measure. Hence, by the direct proof of [44, Th. 2.3], Re(0) 
is full rank w.p.l, and from what was said before it follows that has full rank w.p.l. ■ 

Besides what was stated in Theorem [21 above, relating properties of Cov{$} to properties of 
det(00^) seems difficult. For = Mr, we can refine the result in Theorem[2las follows. 

Theorem 3: For a rank-1 physical channel H = gh^ with Mt = Mr = M subject to phase 
errors with having full rank w.p.l, we have 

det(HH^) = I W^r) (n |[g]«r] det(00^). (49) 
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Proof: The proof follows trivially using Lemma [21 and noting that 

det(HH^) = det(diag(g) ediag(h) (diag(h))^e^ (diag(g))^) 

= det(^(diag(g))^diag(g)) det (^0 diag(h) (diag(h))^0^) 

= det(^(diag(g))^diag(g)) det(^diag(h) (diag(h))^) det(e0^) 

which yields (gll). ■ 
The following Theorem allows a more specific conclusion since it shows that for rank-1 
channels H = gh^ where g and h consist of unit-modulus entries (representative of LOS 
propagation [2]) the rank of the effective channel matrix is equal to the rank of 0. Moreover, 
the eigenvalues of the effective channel matrix (more specifically of HH^) are equal to the 
eigenvalues of 00^. 

Theorem 4: For a rank-1 physical channel H = gh^, where g and h are such that | [g]j| = 1 
(z = 1, 2, . . . , Mr) and | [h],| =1 (z = 1, 2, . . . , Mt), we have 

A,(HH^) = A,(00^), 2 = 1, 2, . . . , Mr, Mr < Mt 
Ai(H^H) = A,(0^0), z = 1,2,...,Mt, Mr > Mt. 

Proof: The proof for both cases is trivially obtained using Lemma [2] and noting that the 
assumptions of the Theorem imply diag(g) (diag(g))^ = Imr and diag(h) (diag(h))^ = Imt- 
For Mr < Mt, simply note that 

Ai(HH^) = A,(diag(g) 0diag(h) (diag(h))''0^ (diag(g))'') 
= A,((diag(g))''diag(g)00^) 
= A,(00^), 2 = 1,2,..., Mr. 



The case Mr > Mt follows exactly the same line of reasoning. 
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Since the high-SNR MI of H (for Mr < Mt) is given hy I ^ logdet((p/MT)HH^), 
Theorems [2l and [3] immediately yield expressions^ for the high-SNR MI of H. However, the pdf 
of the quantity logdet(00^) is, in general, difficult to obtain. Insightful analytic results are, 
however, possible by invoking the assumptions of a separable timing matrix (as discussed in 
Section ITV- A3 1) and of small phase noise, i.e., exp°(j$) ^ 1 + As demonstrated previously, 
a separable timing matrix is obtained by choosing a regular sounding pattern as in (|26|) and the 
small phase noise approximation is very well satisfied in practice as the worst-case value of 7° 
rms phase noise amounts to cr^ ^ 0.0149. The assumption of a separable timing matrix implies 
that frequency offset has no impact on MI (see Section ITV-ASI) . Therefore, as a consequence of 
the two simplifying assumptions, it suffices to analyze the quantity det(00^) with = 
instead of det(00^). Interestingly, det(00^) can be characterized in terms of chi-square RVs 
and a beta-distributed RV, which provides the basis for tight bounds on C = E{/} and for 
accurate approximations of Var{/}. Before stating the corresponding results using the exact 
expression for det(00^), we shall, however, provide an approximation for det(00^) (in the 
sense of distributional equivalence), which turns out to be particularly useful to derive a simple 
analytic lower bound on C (see Theorem [8]). This approximation is based on the following result. 

Theorem 5: For a separable timing matrix T and under the small phase noise approximation 
0"^ ^ 1 so that = 1+ assuming fully uncorrelated phase noise, i.e., vec($) = 
A/'(0,apMTMR), we have 

Mr 



det(00^) ^ + MtMr) n (xif,-,.. + ^(^«)), Mr < Mt 

i=2 

det(0^0) ^ + MtMr) n (xLr_.,.. + Z{r^^^^)) , Mr > Mt 



i=2 



'More specifically an approximation in the case of Theorem |2] due to the presence of the quantities £i and Ui. 
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where the xia^ statistically independenj^ and Z{ri'-^^) = cr^(r7*^*)xf + (1 — ■r]^^'^)X2^) with 



x[^\X2^ i.i.d. as Xi i the r/*^*^ being RVs with pdf supported in the interval [0,1] V 



Proof: We provide the proof for Mr < Mt only. The case Mr > Mt follows exactly the 
same line of reasoning. Let us start by noting that the singular value decomposition of Ia/r.Mt 
is given by 1mr,Mt = 

VSW^, where V is of dimension Mr x Mr, W is Mt x Mt, and the 
Mr X Mt matrix S is given by 



VMtMr', m = n = l 

mm,n = { (51) 

0, else. 



Defining the Mr x Mt matrix S = — jS + $ with $ = V^$W (and hence $ = $), it follows 
that det(e0^) = det(SS^). With S = [si ■■■ SM^f being a square (Mr = Mt) or 
a wide matrix (AfR < Mt), a basic result in geometry (e.g., [47, Th. 7.5.1], [48, Sec. 3.2.2], 
which can be shown to hold in the complex case upon replacing transposition by conjugate 
transposition) yields 

Vdet{SS^y = vol(Ps) = lls^ll lls^ll • ■ ■ ||s|,J| (52) 

where vol(Ps) stands for the volume or MR-content of the parallelotope spanned by the Mr 
row vectors of S, s^*- = Si, and {i > 1) denotes the component of Sj orthogonal to the span 
of the vectors S]^, s^, . . . , s^j^. The orthogonal vectors s^^ (i = 2, 3, . . . , Mr) are obtained using 
Gram-Schmidt orthogonalization and are given by 

lA/.-$^f^ s, = A,s,. (53) 

n=l 11^" II / 

It is well known that applying the decomposition (|52l) to an i.i.d. complex Gaussian random 
matrix S with CJ\f{0,l) elements results in independent chi-square distributed factors Hs^p 

'"Note that the product over i on the RHS of (|50j is equal to 1 if A/r = 1 (in the case Mr < Mt) and Mt = 1 (in the 
case Mr > Mt). 
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(i = 1, 2, ... , Mr) [49, Th. 3.4 ff.]. The problem at hand differs, however, from the i.i.d. complex 
Gaussian case in two aspects, namely the fact that the elements in $ and hence $ are real-valued 
Gaussian and the presence of the deterministic component — jS. 

It follows trivially from the definition of S that ||s^p = Xmt + M^Mb.. From (|53l) we 
can see that, conditioned on s^, s^, . . . , s^^^, the vectors (i = 2, 3, ... , Mr) are JG and 
hence the ||s^p {i = 2,3, ...,Mr) are chi-square distributed. Using the fact that Sj G M*^"^ 
(z = 2, 3, . . . , Mr) and Af^Aj = A,, it follows immediately that 

||s,^||2 = sfA,s,, z = 2,3,...,Mr. 

Next, noting that 

||s,^f = sf (Re(A,)+jIm(A,))s, 

= sj Re(Ai) Si + jsj Im(Ai) s^, i = 2, 3, . . . , Mr 

has to be real- valued for all Sj, it follows that 

\\sir = srRe(AOs, = sf (im, " E ^I^tP) ^^4) 

\ n=l 11^" II / 

Based on (l54l) . we can now invoke Lemma [5] in the Appendix to conclude that the eigenvalues 
of Re(Aj) are given by 

{af } = { 0_^, 1 - r^W}, = 1, 2, . . . , Mt 

Mt-i i-2 

where rj^'^^ = ri^'^'^{sf, s^, . . . , s^^^) is a RV with pdf supported in the interval [0, 1]. Consequently, 
using [41, Eq. (4.1.1)], we obtain 

Mt 

sJ Re(A,) s^^alY: 4^ ^ = X^-..^ + + (1 - V^)^^^ 
k=i " ' 
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where the Xi = xli are independent. ■ 
We shall next show that for cr^ ^ 1, Zirj^^h ~ x? 2, which then implies that 

II -L||2 2 I 2 _ 2 /'«:<r\ 

thereby allowing an approximation of (|50l) ajll] 

Mh-1 

i=l 

Mt-1 



(56) 



In order to see that Zirj^''^) ~ x? 2 , we start by noting that the pdf of Zir]) conditional on r] is 
given by [51, Eq. (5.7)] 

1 - 2r] 



VzM = / e 4-T''(i-'') Jo ( ^ : ^ X 1 (57) 

where Io{z) is the modified Bessel function of the first kind [15, Sec. 9.6]. For cr^ small, we 
can invoke the large-|z| expansion of Io{z) [15, Eq. 9.7.1] according to 

which, when used in (|57]) . upon renormalizing so that J^QPz\ri{x) dx = 1, yields 

1 3: 
Pz|^(a;) ^ ^ _ =e ^^^(^ = p^2 ^ ^ ^ (x) 



27r(T2(i - r])x 



This means that Z\r] = Xia2(i^-q) < 77 < 1 if cr^ is small. We shall next see that ?7*^*\Vi, 
is small, in general, which then directly results in the (unconditional) pdf of Z{r]^'^^) satisfying 
Z{r]^i)) ^ xla'^,^i- Recall that {r^W, 1 - r/»} are the nonzero, nonunity eigenvalues of Re(Aj) 



"We would like to use this chance to point out that the distributional equivalence in [50, Prop. 4] should be an approximate 
equivalence (as in l l56t ). Furthermore, CAf{0, (t|>ImtMh ) in [50, Prop. 4 and Prop. 5] should be replaced by Af{0, o-|lA/xAfn )■ 
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in (|54l ). The first pair of such eigenvalues is obtained for i = 2. Due to the symmetry of the 
eigenvalues, we may investigate I — Re(A2) instead of Re(A2), which, using s;j'- = Si, can be 
written as 

I - Re(A2) = = Re(si) (Re(si))^ ^ Im(si) (Imlsi))"^ 



\ifi\\^ + \\(t\\^ 



+ 



Si 



(Tcr 



where (p = Re(si) and cr = — Im(si) = [ v^MtMr ' ■ ■ ■ 0]^. In the following, we denote 
yjj = The nonzero eigenvalues of I — Re(A2) are equal to the eigenvalues of 



given by 



(2)^ 



2 2 MtMr, + ||c^||2 



(58) 



For Mt sufficiently large, with WipW^ = ipf + ip^, where ip"^ = J2i=2 we can replace (1581) by 
Next, since is small compared to MtMr, we obtain the first-order Taylor series expansion 



{XMT-l,a2/(MTMR)) 1 XMt-1,<t2/(MtA/r)}- 



Hence, we have 



Var{r/(2)} = 2^; 



4MT-I 



which shows that, for sufficiently large Mt, Mr, r/^^^ is indeed small. For Mr x 1 and 1 x Mt 
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systems, i.e., for SIMO and multiple-input single-output (MISO) systems, respectively, we can 
therefore immediately conclude that the approximation (l56l) is very accurate. In the case of 
general Mt and Mr, it seems difficult to prove that 77^*^ ~ for i > 3. We do, however, have 
strong numerical evidence that this is, indeed, the case. 

Recalling that ^ 0.0149 for the worst-case phase noise value of 7°rms, we can conclude 
that the assumption ^ 1 made in Theorem [5] and in (|56l) is very well satisfied in practice. 
Fig. [6] shows the cdf of logdet(©0^) corresponding to the approximation (l56l) along with 
the exact cdfo (in both cases obtained through Monte Carlo methods). We observe that the 
approximation is excellent in general and, indeed, becomes better for smaller and/or for less 
symmetric (in terms of the number of transmit and receive antennas) configurations. We finally 
note that comparing (|56l) to [52, Eq. (3)] suggests that for fully uncorrected phase noise with 
(7^ ^ 1, the effective MIMO channel behaves like a physical MIMO channel consisting of a 
rank-1 Ricean component plus an i.i.d. Rayleigh fading component with the difference that in 
our case the chi-square RVs have half the order of those in [52] (reflecting the fact that here we 
are dealing with real-valued Gaussian RVs). 

We proceed by stating the result on the exact distribution of det(0©^). 

Theorem 6: For a separable timing matrix T and under the small phase noise approximation 
0"^ ^ 1 so that = 1+ assuming fully uncorrelated phase noise, i.e., vec($) = 
A/'(0,apMTMR), we have 



det(00^) 4 i^^l^^^^^, + MtMk/3(^, ^\^) j fl xk-.+i.^. < Mt 

det(0^0) 4 Ll^^^^^,^ + MtMr/?(^, ^^)] n^k-.+i,.^' > 



(59) 



Note that "exact cdf means exact under the hnear phase noise approximation. 
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where = |Mt — MrI, and the Xna^ statistically independent. 

Proof: Again, we provide the proof for Mr < Mt only. The case Mr > Mt follows 
exactly the same line of reasoning. We start by noting that the matrix S defined in the proof of 
Theorem [5] is unitarily equivalent to the matrix S' = — jS' + where the Mr x Mt matrix 
S' is given by 




VMtMr', m = n = Mr 
0, else 



and = $ = In what follows, we shall work with S' and, by slight abuse of notation, 
denote it as S. The pdf of ||sf |p follows trivially from the definition of S and is given by 
xif 2 • Applying Gram-Schmidt orthogonalization, due to the nonzero entry in S' being at 
position m = n = Mr, we can conclude that for i = 2,3, . . . , Mr — 1, the matrix Re(Aj) has 
only two distinct eigenvalues, namely with multiplicity i — 1 and 1 with multiplicity Mt— i + 
Consequently, (154)) implies that ||s^|p (i = 1,2, . . . , Mr — 1) conditioned on sf, s^, . . . , is 
distributed as Xmx-i+i ■ 'Since the eigenvalues of Re(Aj) do not depend on sf, s^, . . . , 
and the statistics of ||s^|p depend on Aj only through the eigenvalues of Aj, we can conclude 
that the unconditional distribution of lls^lP satisfies llsHP = xlr -,4-i 2 (i = 1,2, . . . , Mr — 1). 
For i = Mr, noting that A^^ is a real-valued matrix, we have 

WsiiJ'^ = ||Ajv/R,Re(sMR)||^ + IIAmrMsmr)!!^. (60) 

The distribution of the first term on the RHS of (|60l) can be shown, using the same line of 
reasoning as for i = 1, 2, . . . , Mr - 1, to satisfy || AA/j,Re(sMj,)||2 = xli^-Mn+ha^^- second 
term on the RHS of (l60l) can be expanded as 

||AMRlm(sAfj,)f = (lm(sA/j,))'^A^^^AMRlm(sA/j,) 
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(lm(sMR))^AMRlm(sMR) 



where we made use of the fact that Amr is real- valued and hence A^^^Amh, = Amr reduces 



to A^j^Amr = Amr. Next, we note that 



Amr — Ijwt ~ 



with 



= fG Kl 





Imt-Mr+1 



G K 



n=Mn 



G 



'Mr-1 



'Mr -II 



and K = [ umr Umr+i 



UMt 



and the vectors (n — Mr, Mr + 1, . . . , Mt) have to be chosen such that the matrix U = 
[G K] satisfies UU^ = GG'^ + KK^ = I. Recognizing that the vectors s,^/||s^|| (i = 
1,2,..., Mr — 1) are obtained by applying the Gram-Schmidt procedure to the real-valued 
(Mr — 1) X Mt i.i.d. Gaussian matrix [si S2 • • • smr-i]^ with zero-mean entries, we can 
take the stacked matrix U = [ G K ] to be given by the Q-matrix obtained by applying the QR- 
decomposition to an Mt x Mt i.i.d. real-valued Gaussian matrix with zero-mean entries. Note 
that using the Gram-Schmidt procedure for QR-decomposition yields the unique factorization 
characterized by positive entries on the main diagonal of the R-matrix [43, Th. 2.6.1]. Next, 
realizing that 

/ Mt \ 

||AMRlm(sMR)||^ = (im(sMR))^ ^"^^n Im(sMR) 



^n=MR 



Mt 



= MtMr Y1 [u]Lr, 



n=Mi 



R 
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the proof is complete upon deriving the pdf of XlnJMR [Uli/R.n- is well known that, applying 
any procedure for QR-decomposition leading to the unique factorization where the elements on 
the main diagonal of the R-matrix are positive, the resulting Q-matrix Q is distributed such that 
AQB = Q for any orthonormajl^ A and B [53, Th. 3.2]. Choosing A and B to be permutation 
matrices, we can conclude that the rows and columns of Q, and hence U in our case, are all 
equally distributed. Now, the quantity we are interested in is the sum of squares of the elements 
{Mr, Mr + 1, . . . , Mt} in any such row or column. Specifically, if the Gram-Schmidt procedure 
is used to obtain the QR-decomposition, the first column of U is given explicitly as Si/||si||. 
From [14, Def. 1.4] we know that the quantities [si]^/||si||^ are jointly Dirichlet distributed, i.e.. 



Partitioning s into subvectors of length Mt— Mr+1 and Mr— 1, respectively, and employing [14, 
Th. 1.4 and Th. 1.5] (reproduced as Theorem [TOl and Theorem [TTl respectively, in the Appendix 
for convenience), it follows that 



where /3(a,6) is a beta-distributed RV with parameters a and h as defined in the Notations 



Note that even though the results in (l56l) and Theorem [6] have a striking similarity and (l56l) 
provides an approximation for the exact result in Theorem [6l it seems difficult to derive (l56l) 
directly from Theorem [6l 

We are now ready to state an analytic lower bound on the ergodic capacity of an effective 
channel resulting from a rank-l physical channel with unit-modulus entries subject to fully 
uncorrelated phase noise. 

'^The matrix A is said to be orthonormal if AA^ — I. 





section. 
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Theorem 7: For a separable timing matrix T and under the small phase noise approximation 
(T^ < 1 so that = 1 + j$ with vec($) = A/'(0, ct^ImtA/r ), assuming that H = gh^ with 
|[g].| = I {i = 1,2, ...,Mr) and |[h]i| = I [i = 1,2,...,Mt), the ergodic capacity of the 
effective channel H satisfies 



e S log(l + I {£)' (?) n (^.bMk + 24e*(^)) j . > M,. 



(61) 



Proof: We provide the proof for Mr < Mt only. The case Mr > Mt follows exactly the 
same line of reasoning. We start by using Lemma|2]and noting that our assumptions imply that 

i = log det (l + -^HH") = log det (l + ^@B"\ (62) 



where S = — jS + $ was defined in the proof of Theorem [51 Next, using [35, Eq. (25)], it 
follows that 



C 



> log('l + ^ ("^y Yl e4"'^M(^^''\<'-< -<k)}\ (63) 



where (SS^)^, 

<h<...<ii denotes the submatrix of SS^ obtained by retaining the rows li < 
I2 < ... < U and the columns li < I2 < ... < The summation in (1631 ) is over all 
ordered tuples (/i, I2, ■ ■ ■ , k) chosen from the set {1,2,..., Mr}. Next, we note that the pdf 

of det((SS^);, 

<Z2< <'i) follows in a straightforward fashion from the results developed in the 
proof of Theorem[5] In particular, det ((SS^);^</2<...<«i) is the determinant of the matrix SS^ 
where the i x Mt matrix S is obtained from S by retaining the rows {li,l2, ■ ■ ■ ,k}- Distinguishing 
between the terms, in the summation over li < I2 < . . . < h on the RHS of (l63l) . that have 
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h = I and those where li > 1, we obtain 

Ejloge det((SS^),,<,,<...<0 } > E<' lege ( + MtMr) J] xli^-i,.^ ] \ (64) 



i-l 



1=1 



in the former case and 



'i-l 



E{logedet((SS^),,<,,<...<0} = E<^ loge n^M,-^,.j (65) 



J=0 



in the latter case, where (a) is obtained as follows. Recognizing that det((SS );^<i2<...<«J = 
<i2<...<ii) and applying (|56l) properly modified to account for the fact that we are 
interested in the submatrix of ©0^ obtained by retaining the rows {li < I2 < . . . < U} in 
would yield an approximate expression for the left-hand side (LHS) in (|64|) . However, using 
Theorem [5] and invoking Theorem [T2l in the Appendix, we can show that the lower bound in (|64|) 
holds firmly. Specifically, starting from (l50l) and setting X^^^ ^2 = E{loge(MTMR + Xmtcts)}' 
for brevity, we can rewrite the LHS of (l64l) as 



E{logedet((SS^),,<,,<...<0} = ^'m^,..^ + Y.^x,,xM^A^^^e{xli^^i,„i +^(^^'^))} (66) 



1=2 



= ^W,.^ + 5^Ex,E,,oE;,^,^^|;,^,,(o{loge(X, + Z(r/«))} 
1=2 

(a) * 



1=2 

Y 

i-l 



1=1 

where (a) follows from Theorem [121 in the Appendix. The relation in (|65l) is obtained in exactly 
the same fashion upon noting that the term — jV^t^r' is absent in the sets (li,l2, ■ ■ ■ ,k) 
where li > 1. The number of terms in the first group (where li = 1) is given by (^^^7^) 
whereas the number of terms in the second group is 

(^f) - • It remains to find analytic 
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expressions for the RHS of ([641) and of dMl)- It is well known [54] that E{loge(x^ ^2 )} = 
loge(2a2) + m{n/2) = X„,,2 . The term T,,^^^, = E{loge(xL^,,2 + MtMr)} has a closed-form 
analytic expression in terms of the generalized exponential integral Eiy{z) = t^'^e^^* dt, 
Re(2;) > 0. For our purposes, we shall, however, be content with a simple lower bound obtained 
by applying Jensen's inequality to the function f(x) = loge(e'^ + a), which results in 



> log,(e^^'°'^^'^«T,^^^/ +MtMr 
= loge(e^*^T.^^+MTMR). 



(67) 



Putting the pieces together, we get 



V— — 1 -V- V— vi — 1 -V- 



for dMl) and 

for (|65]) . Combining our results and noting that the term exp( ^^'^q 0^2) occurs in both 

cases so that its total number of occurences is ), finally yields 



^ gE{logedct((SS«),^<,2<...<,J} 



h<l2<...<li 



(68) 



i — 1 J \ i 

which, upon inserting into (|63l) and reorganizing terms, concludes the proof. ■ 
The result in (|6TI) can be made more explicit by using the simplifications for the digamma 
function at positive integer multiples of 1/2 given by ([3]). Furthermore, we note that Theorem |7] 
can be generalized to the cases where i) |[h]j| = 1, Vi, g is general and Mr < Mt and ii) 
|[g]i| = 1, Vz, h is general and Mr > Mp. The corresponding results are stated, without proof. 
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as 

Mr / \ n n-1 



ii) a>logh+5^(^^j K„(h)n(^5,nMR + 2aJe''i^ijj, Mr > Mt 



n=l ^ ^ ^ i=0 

where Kn{^ = Xlses ;( ) IliLi ll^l^iP' '5fc,m is the set of all possible ordered fc-tuples s = 
(si, S2, . . . , Sk) with 1 < si < S2 < . . . < Sfc < m, and /(x) is the number of elements in the 
vector X. 

Again assuming | [g]j| = 1 (i = 1, 2, . . . , Mr) and | [h]i\ = 1 (i = 1, 2, . . . , Mt), further lower- 
bounding (|6TI) by ignoring the first term inside the "log" and retaining only the highest-order 
(in p) term yields 



C*> MRlog(^^^ +log|^ f] (^5,MTMR + 2aJe*(^)^ j , Mr < Mt 
a > Mt log +log(f[ [SiM^M^ + 2aje"'(^) ^ j , Mr > Mt 



i=0 

which clearly shows that the effective channel has full rank and hence its multiplexing gain 
is given by min(MT,MR). Put differently, phase noise can cause a rank-1 physical channel to 
appear like a full-rank channel. In Section VII, we shall show, based on measurement results, 
that significant rank increase does, indeed, occur in practice. 

We shall next provide a slightly looser (than (|6T| )) lower bound on C with a simpler structure. 

Theorem 8: For a separable timing matrix T and under the small phase noise approximation 
(T^ < 1 so that = 1 + j$ with vec($) = A/'(0, cr^lAfTJ\/R)' assuming that H = gh"^ with 
|[g].| = I a = 1,2, ...,Mr) and |[h]i| = 1 (i = 1,2, ...,Mt), the ergodic capacity of the 
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effective channel H satisfies 

^- 1^ f 1 + ^ (^MtMr5, + 2aje*(^) J j , Mr > Mt. 

Proof: We provide the proof for Mr < Mt only. The case Mr > Mt follows exactly the 
same line of reasoning. We start by noting that (|64l) and (l65l) can be combined as 

i-l 

E{log,det((SS^),,<,,<...<0} > J]E{log,(MTMR5,(5,,_i + xl,,_,,.2)} 
which, upon inserting into (|63l) . yields 

The proof will be completed by showing that 2'^'^ > with 

C^i = log n f 1 + ^ ' ^ " " ' ) (70) 



and noting that the RHS of (1691 ) is obtained by lower-bounding the term corresponding to 
i = in (fTOl) according to (l67l) . Setting X„ = XM^r-n.^r^ = 0, 1, ... , Mr — 1) and Xq = 
E{loge(MTMR + Xmt,<x2)}' and expanding 

V Mt / V 
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we get 

2^1 = 1 + (^-^^ (e^° + + • • • + e^^R-i) 

2a) + f e^oe^i + e^^e^^ H h e^oe^^R-i 

2b) + + H \ h e^^R-^ e^^R-^ ) 

3a) + (^;^) (^e^oe^^e^^ + e^oe^^e^^ + h e^oe^^R-2e^^R-i (VI) 

3b) + e^^e^^e^^ H h . . . + e^^R-Se-^^R-^e^^^R-i 

+ : : 

where lines 2a) and 2b) contain (^R~^^ and (^^"2^) tenns, respectively, and lines 3a) and 3b) 
contain (*^2~^) and (^^3^) terms, respectively. Expanding 

Mr / \i ( ( ''-^ \ / ' 



i=l ^ ^ \\=lx<h<-<k \ ;=1 / \<l\<li<...<li \ l=\ 



we obtain 



2a) + f 1^ V f e^'^e^^ + e^og^^ + . . . + e^og^i 



2b) + e^°e^i + e^°e^i + ••• + ••• + e^°e^i 



3a) 



3b) + e^°e^'e^^ + e^°e^^e^^ + ■■■ + •■• + e^°e^^e^^^ 



+ : 

Mr 



s;) (»^»^*-^"«-) 
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where the number of terms in lines 2a), 2b), 3a), and 3b) is the same as in the corresponding 
lines in the expansion of 2^1. In both cases, the number of terms associated with the factor 
(p/Mt)* is given by (*^"). The proof is completed by comparing dTTI ) and (1721) term by term 
and noting that the monotonicity of the digamma function implies Xmt-h.ct^ > XMT-n-k,c7'^ and 
hence X„ > Xn+k for A; > 1. ■ 

So far we derived lower bounds on C. We shall next show that the result in Theorem [5] together 
with a technique first proposed in [35] (and used to derive the lower bound in Theorem |7]) can 
be employed to derive a tight analytic upper bound on C. 

Theorem 9: Under the assumptions in Theorem [8l the ergodic capacity of the effective MIMO 
channel can be upper-bounded as 

Proof: We provide the proof for Mr < Mt only. The case Mr > Mt follows exactly the 
same line of reasoning. The proof starts from [35, Eq. (19)] which, specialized to our case, reads 

5<log(l + ^[^') E{det((SS^),,<,,<...<0}] (74) 

\ n=l ^ h<h<...<ln J 

where S was defined in the proof of Theorem [51 The main point of the proof is to recognize 
that we can obtain analytic expressions for the terms E{det((SS^)ii</2<...</n)} using (|52l) . As 
already shown in the proof of TheoremlTl the terms in J2ii<i2< <i„ ^{'} ^^e RHS of (1741 ) 
fall into two groups depending on whether /i = 1 or /i > 1. Specifically, for /i = 1 we have 
(cf. dM])) 

n 

E{det((SS^)z,<,,<...<,J} = MTMRHElxk-,.. + ^(r/^^^)} 

n 



1=2 



62 



and for h > 1 (cf. dMl)) 

n 

E{det((SS^),,<,,<...<,„)} = E{xL„.. } HeIxm,-,.. + 

1=2 

Using ^{XM^-^,^2} = (Mt - i)al, nodng that 

= + - r^W)} = E^« {aj} 

and counting the mukiplicity of the terms as in (|68] ). we obtain 



J] E{det((SS^)z, J} 

ll<l2<.-.<ln 



Mr-1 



^ IMtMrJI ((^T-^ + 1)0-5) + ( ^'')^Ta5n((^T-^ + l)aJ) 



TiMt! , ^Mr\ Mt! 2^.. 



n y (Mt ~n)V \ n J (Mt - n)! 



Putting the pieces together, we get (1731) . which concludes the proof. ■ 
We note that the proof of Theorem [9] can alternatively be carried out by obtaining analytic 
expressions for the terms E{det((SS^)ii<z2<. ..<;„)} using properly modified versions of (l59l) . 

Numerical results: We shall next provide a numerical example that serves to quantify the 
quality of the lower bounds in Theorems |7] and [8l and the upper bound in Theorem HI For a 
4x4 deterministic physical channel H = gh^ with |[g]j| = 1, Vi, and |[h]j| = 1, Vi, subject to 
7° rms fully uncorrelated phase noise, Fig. |7] shows that the (ergodic) capacity of the effective 
channel starts deviating from the capacity of the rank-1 physical MIMO channel at p ^ 15 dB, 
and that significant capacity estimation errors (up to around 100%) occur in the high-SNR regime. 
This behavior is consistent with our observation that the low-SNR capacity is not influenced by 
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phase noise. Moreover, we observe that the lower and upper bounds (|6T] ) and (1731) . respectively, 
very accurately predict the capacity behavior of the effective channel. 

High-SNR Variance of MI: Considering an ergodic block-fading MIMO channel, it was shown 
in [33] that the (high-SNR) variance of MI can be interpreted as quantifying the amount of 
"spatial averaging" that occurs on a per-stream basis in each fading block. The smaller the 
variance of MI the more spatial averaging occurs. As shown in [33], [55], a] = Var{/} for fixed 
Mr, as a function of M^, has its maximum at Mr = Mr. For more details on the interpretation 
of a] as a measure of the amount of spatial diversity, the interested reader is referred to [33]. 
We have seen that phase noise (and frequency offset) can have a significant impact on the 
rank of the MIMO channel and hence its spatial multiplexing gain. In the following, we shall 
characterize the increase in spatial diversity due to phase noise by analyzing the variance of 
the high-SNR MI of the effective MIMO channel. Finding exact expressions for cr| = Var{/} 
seems difficult. Under the assumptions in Theorem [8l we can, however, provide accurate and 
analytically tractable approximations, which are obtained as follows. Considering, for simplicity, 
the case AfR < Mt, we can infer from (l62l) and (l56l) that in the high-SNR regime 




) 



a/r-1 



log 



Mt 



P 



+ log(MTMR + xL„.^) + E log(xL 



) 



(75) 



i=l 



Writing the second term in (fTSl) as 




and noting that for MtMr large and cr'i small, we have 
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it follows that 



,2 Afa-l 



Var{/}^Var<|log(e)^+ £ log(xk-.., 



Using 



Var|loge(^XMT-i,<x2) } = Var{loge(crJ) + loge(xl/T-i,i) } 

= Var{log.(xVJ} = v&'(^^ 

and [54, App. A. 7], we finally get 

Var{/} ^ (log(e))' — ^ + Y Y ' 2 • (76) 

Comparing (1761) to the expression [33, Eq. (31)] for the variance of the high-SNR MI of an i.i.d. 
Mr X Mt complex Gaussian channel, we can show that, for the same number of transmit and 
receive antennas, the variance of the high-SNR MI of a rank-1 physical channel as in Theorem [8] 
subject to fully uncorrelated phase noise is higher than that in the i.i.d. complex Gaussian case. 



I.e., 



9 -.4 JV/r-I . Mr 



To prove dTTj) . we omit the first term on the LHS and use Lemma |4] in the Appendix which 
leaves us with having to show that 

Mr-1 Mr-1 

2 J2 ^'(^T -i)> Yl ^'(^T ~ ^) • ^^^^ 

i=l i=0 



Subtracting the common terms on both sides, it follows that (1781) is equivalent to 

Mn-l 

^'(Mt -i)> ^'(Mt) . 

i=l 



BAUM AND BOLCSKEI; INFORMATION-THEORETIC ANALYSIS OF MIMO CHANNEL SOUNDEMG 65 

The final result follows from the monotonicity property ^'(M^ — i) > \E''(Mt), i > 1. We 
note that (TTSl) suggests that the variance of MI in the phase noise case is essentially twice that 
obtained for an i.i.d. Gaussian channel with the same number of transmit and receive antennas. 
The underlying reason lies in the fact that the individual chi-square terms in the phase noise case 
(cf. (|64|) and (l65l) ) have half the number of degrees of freedom when compared to the Gaussian 
channel case. The following numerical example corroborates the factor-2 statement motivated 
by the inequality (TTSl) . 

Numerical result: For a deterministic rank-1 physical channel H = gh^ with |[g],;| = 1, Vi, 
and |[h]j| = 1, Vz, subject to 3.5° rms fully uncorrelated phase noise, Fig. [8] shows cr| according 
to the approximation (l76l) along with the exact result (obtained from Monte Carlo simulation) 
for Mr = 10 as a function of Mt. We can see that the approximation is very tight for Mt > Mr 
and Mt < Mr and predicts the location of the maximum of cr| accurately. For comparison, 
we show erf according to the approximation [33, Eq. (31)] along with the exact result (obtained 
from Monte Carlo simulation) for an i.i.d. Rayleigh fading channel with the same number of 
transmit and receive antennas. It is clearly seen that both types of channels exhibit similar MI 
variance behavior as a function of Mt, and that a| ^ 2a|. 

VII. Measurements 

In this section, we provide results from measurements taken with a commercially employed 
TDMS-based MIMO channel sounder by applying the rank-1 "calibration procedure" discussed 
in the beginning of Section |Vll Before elaborating on the measurement results and comparing 
them with our analytic results, we shall verify assumptions on the system model and on the 
phase error characteristics that were made throughout the paper. 

A. Description of the Measurement Setup 

In the following, a MIMO channel snapshot indexed by the superscript with / = 1, 2, . . . , L 
consists of one snapshot of each of the MtMr scalar subchannels. Since the physical channel 
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is Static (i.e., the cable transfer function is static), time-alignment, as discussed in Section ITl-Ci 
is not needed. The scalar subchannel snapshots in the /th MIMO snapshot are taken at times tf' 
(/c = 1, 2, . . . , MtMr). All measurement results in this section are based on L = 1100 MIMO 
snapshots. 

Parameters specifying the channel sounder employed and the measurement setup are provided 
in Table H The sounder is based on circular convolution and uses a regular sounding pattern 
according to (|26|) . implying that frequency offset has no impact on MI. As a consequence of 
(|26|) . the SISO snapshot distance is given by Tr. The sounder inserts a "dummy" receive antenna 
(i.e., one unused SISO snapshot) to accomodate switching between transmit antennas, so that 
Tt = (Mr + 1)Tr. Note that (|26l) implies that the SISO snapshot corresponding to the dummy 
receive antenna is simply omitted. The sounding sequence results from a periodically extended 
m-sequence of length (chips) with period NT, where 1/T is the chip rate. 

The phase noise process, in general, has a distinctive low-pass characteristic. Increasing the 
SISO snapshot time distances, and thereby reducing the rate at which the continuous-time phase 
noise process is effectively sampled, results in stronger decorrelation of the time-discrete phase 
noise process underlying the effective MIMO channel. This, in turn, leads to increased error in 
estimating the MI. On the other hand, the minimum SISO snapshot time distance is determined by 
the antenna switching speed, the duty cycle (as explained below) and, in particular, the sounding 
sequence length. Specifically, reducing the sounding sequence length leads to a degradation of the 
sequence-correlation properties and hence a reduction in sequence SNR, which in turn implies 
lower measurement SNR. In summary, there is a tradeoff between the sequence SNR, determined 
by the time-discrete sounding sequence, in particular by its length, and the MI estimation error 
due to decorrelation of the phase noise process underlying the effective MIMO channel. To 
further understand this tradeoff, we performed measurements based on two sounder setups, as 
defined in Table H with different sequence lengths. 

We would like to point out that the various sounder settings (including sounding sequence 
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length) can usually not be chosen independently due to hardware limitations of the sounder. 
One such typical limitation is the overall duty cycle, i.e., the ratio i] = MtM^NT /Tmimo of 
sounding time (where the received signal is recorded and processed) to total measurement time, 
where Tmimo is the MIMO snapshot distance. Channel sounders typically employ a small overall 
duty-cycle to limit the real-time signal processing and data-storage requirements. The overall 
duty-cycle can be separated into one within and one between MIMO snapshot periods given by 
r/intra = MtMrNT / {tfj^^,^ - 1? + NT) and r^inter = (t^M^-tf^ +NT)/TMmo, respectively, 
with r] = r^intra^^inter- In general, r^intor has no influence on the estimated MI as the correlation 
properties of the channel and/or phase noise across MIMO snapshot periods do not play a role in 
our considerations. If r] is fixed, it is therefore preferable to have r^intcr small and r^intra large. In 
the two setups considered here, this is indeed the case, with r/inter = 0.1431 and r^intra = 0.4697 
for the length-511 sequence, and r^mter = 0.05 and r/intra = 0.3193 for the length-31 sequence. 

The transfer function of the cable used to connect transmitter and receiver is flat (recall that 
we are performing the rank-1 calibration procedure decribed in Section IVTl) over the frequency 
range of interest. In the sounder under consideration, the overall (i.e., effective) channel induced 
by s{t) and r(t) along with the cable exhibits, however, some delay spread. This is mainly due to 
overs ampling of the signal at the output of the receive frontend filter by a factor of 2 (relative to 
the chip rate). We estimated the power-delay profile (PDP) of the resulting effective channel and 
identified the position of its peak. For further processing we used only the signal corresponding 
to the peak of the effective channel's PDP. 

B. Verifying Assumptions on Phase Error Characteristics and System Model 

The purpose of this section is to investigate the general phase error characteristics of the 
sounder under consideration, to verify our assumptions on the phase error statistics stated in 
Section iri-B2l and to verify the system model assumptions stated in Section ITl-BSi Correspond- 
ingly, the following discussion is organized into three parts. 
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1) General phase error characteristics: We start by giving an impression of the phase varia- 
tion characteristics over different time horizons. Fig. |9 |(a)| shows the long-term behavior (multiple 



seconds) of the phase-unwrapped raw channel estimates o Each physical SISO channel measured 
corresponds to the cable connecting transmitter and receiver. Since the cable exhibits a frequency- 
flat and constant (over time) transfer function, any variation (over time) in the measured channel 
must necessarily come from channel sounder nonidealities, or more specifically from phase 
noise and frequency offset in the transmit and receive LOs. It is clearly seen that the unwrapped 
phase contains a linear component resulting from the carrier frequency offset between transmitter 



and receiver. Fig.|9 J(b)| shows the trace of the residual phase obtained by removing the linear 
component from the overall observed phase. This linear component was estimatej^ to correspond 
to a relative (w.r.t. the carrier frequency of 5.25 GHz) frequency offset of 7.2 ■ 10^^^, indicating 
excellent performance in terms of carrier frequency accuracy. Moreover, as already pointed out, 
since we are employing a regular sounding pattern according to (|26|) . a carrier frequency offset 
does not have impact on MI. Taking a closer look at the phase residual in Fig.|9 J(b)l we can 
see that it exhibits two different constituents: a constant "thickness" corresponding to essentially 
uncorrelated (on the time scale used) random fluctuations and a component of comparatively 
larger amplitude and higher temporal correlation containing at times abrupt changes (see, e.g.. 



the area marked in Fig.|9 |(b)| ). These abrupt phase changes may be caused by spontaneous phase 
jumps of the reference oscillators, smoothed out by the PLL. If the PLL bandwidth is small 
compared to the MIMO measurement rate (i.e., the rate at which MIMO snapshots are taken), 
we can, however, neglect the impact of the abrupt phase changes. 

Fig. [To] shows the phase noise trace (i.e., the overall phase after removal of the estimated linear 

'*In Figs.[9]and[T0l the time axis corresponds to the sequence of scalar subchannel measurements, with a total of MtMrL 
(with L = 1100), taken according to the regular sounding pattern l l26t . 

"Note that the low duty-cycle between MIMO snapshots increases the possibility of the phase containing jumps larger than 27r 
between consecutive MIMO snapshots, thereby resulting in erroneous unwrapping and hence carrier frequency offset estimation 
errors. Estimation was performed through least-squares fitting. 
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component) during one MIMO snapshot period taken witlo length-31 and length-511 sounding 
sequences, respectively. Each dot in the two figures represents the phase of one scalar subchannel 
measurement (note the different absolute time scales in Figs. |10|(a)| and |10|(b)[). In accordance 



with what was said earlier, one can immediately see that the phase trace corresponding to the 
length-31 sequence shows significantly higher correlation between successive SISO snapshots 
than that for the length-511 sequence (which can essentially be considered i.i.d.). While the 
shorter (length-31) sequence thus is clearly preferable regarding the phase noise properties, 
it may fail to yield sufficiently high sequence and consequently measurement SNR (see the 
discussion in Section IVII-AI) . The corresponding estimated rms standard deviation a^p (estimated 
by averaging the standard deviation per MIMO snapshot over all L = 1100 MIMO snapshots) 
was found to be 3.9° for the length-31 sequence and 3.8° for the length-511 sequence. These 
values agree very well with the 3.5° rms value used as "typical" case throughout the paper. 

2) Verification of assumptions on phase error statistics: We start by showing, in Fig. [TH the 
cdf of the measured MIMO snapshot phase noise traces (i.e., the overall phase after removal 
of the estimated linear component), normalized to zero-mean, and plotted on a scale where 
Gaussian distributions show a linear behavior. The normalization of the mean was performed 
by computing and subtracting the empirical mean on a MIMO snapshot by MIMO snapshot 
basis. It is clearly seen that both for the length-31 and the length-511 sounding sequence an 
excellent match with an A/'(0, (3.9°)^) distribution is obtained. This allows us to conclude that 
the assumption of a Gaussian phase noise process (see Section iri-B2|) is well justified. 

The main differentiating factor between the phase noise models discussed in Section iri-B2l 
is stationarity. It therefore remains to verify the stationarity assumption (on the time-scale of 
one MIMO snapshot) made throughout the paper. In particular, we need to show that the phase 
noise sequence ipn resulting from samples taken at the time instants nTji exhibits stationary 

""Note that the length of the sounding sequences is in fact SIT and 511T, respectively. 
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behavior. This can be done by examining the phase differences a{k,n) = (^k — Vk+n, or more 
specifically the variance a'^{k,n) = ¥,{a'^{k,n)} (note that E,{a{k,n)} = 0). In the case of 
stationary phase noise, we have a'^{k,n) = cr^(n) = 2(T^(1 — r<^<^(n)), where r^^{n) denotes the 
phase noise autocorrelation function. Assuming that the stationary process has limited memory, 
i.e., |r^i^(r;,)| for \n\ — »• oo, we have lim^^oo Cal'T^) = 2cr^. On the other hand, if the 
nonstationary phase noise model, described in Section iri-B2[ would be applicable, we would 
have al{k,n) = cr^(n) oc \n\ [23, Lemma 8.2]. 

Fig. [I2] shows the estimate A/(3"^(n)/2' for the length-31 and the length-511 sounding sequence, 
where o-^(ri) is obtained as follows. The measured phase noise sequence (fk corresponds to the 
time instants tk. Consequently, (pk is the result of sampling at a rate of 1 /Tr, omitting, as already 
pointed out, the samples corresponding to the dummy receive antenna. Finally, E{a^(A;,n)} is 
estimated by averaging a'^{k,n) in each MIMO snapshot, over all indices k and n satisfying 
tk+n — tk = nTji with k G [1, MtMr] and k n E [1, MtMr], and then averaging the results 
over all snapshot periods / G [I, L]- We can clearly see that, for both sampling sequence lengths, 
a/ cr'^{n)/2' levels out at a constant value, which allows us to conclude that over the time frame 
of interest (i.e., one MIMO snapshot period) the assumption of stationary phase noise is well 
justified. Moreover, we can observe that the large-n limit of ^y(r'^{n)/2' is close to 4°, which is 
very well in accordance with the 3.8° and 3.9° rms values estimated earlier. 

3) Verification of the system model assumptions: Based on our measurement results, we shall 
next quantify the impact of phase errors on sequence SNR and discuss the validity of the 
assumptions (fT8l) and (fT9l) . 

While we have seen in Section iri-B3l that it is safe to assume the absence of peak-shifting 
in the sounder under consideration (recall that the sounder employs m-sequences), phase errors 
will, in general, lead to a reduction in sequence SNR and hence also in measurement SNR. 
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Defining 



SNRi 



c'(zT)P 



2' 



I = 



l,2,...,iV-2 



we estimate (Monte Carlo) the histogram of the overall SNR, defined through its histogram 
to be given by the average of the histograms corresponding to the quantities SNRj; 10000 
phase noise realizations per lag were used. In the absence of phase noise an m-sequence has a 
(deterministic) sequence SNR of 201ogio(m), which, for m = 511, equals 54.2 dB. The phase 
noise in the sounder under consideration randomizes this value to a mean of 53.8 dB and a level 
below 49.1 dB w.p. 0.1%. We will see later that 49.1 dB lies above the sounder's SNR caused 
by thermal noise. Consequently, the effect of phase noise on sequence SNR, albeit noticeable, 
can be considered insignificant. 

Condition (fTSi) is verified as follows. We start by noting that, for a periodically extended m- 
sequence and circular convolution in the receiver, as employed in the sounder under consideration, 
the integration interval in ([l8]) has length \3p\ = NT. With Auj = 27rA/ = 2.4rad/s, NT = 
310 ns for the length-31 sounding sequence, and NT = 5110 ns for the length-511 sounding 
sequence, we have \Auj^\ < 12.3 • lO^^rad, which implies that we can safely assume that 
ex-p(jAuj^) ^ 1. If the inverse of the bandwidth of (fR^t) and (^T^t) is much larger than NT 
and/or the variance of (pR{t) and (pT{t) is small, the term exp( — j[(pR{tk + ~ V^t(4 — 
rj:^ +0]) will remain essentially constant over an interval of length NT thereby, together with 
exp(jAc<j^) ^ 1, resulting in 



which confirms (fTSi) . From Fig. [I2l we can see that the phase noise process' bandwidth is small 
enough for phase noise to sustain a correlation coefficient of at least 0.75 over the duration of 
a length-511 sounding sequence. In addition the phase noise variance is very small as well. 




r<'))d« « / 



*(e-rrode =1 



(79) 



72 

Condition (fT9l ) requires that the inverse of the coherence bandwidth of the phase noise process 
(fTit) is large compared to the maximum multipath delay difference. From Fig. [I2l we can 
conclude that for the sounder under consideration this is the case only for multipath delay 
differences of up to 5 |j.s, i.e., only for indoor or small-cell cellular outdoor propagation scenarios. 
Consequently, assumption (fT9l) needs to be carefully verified on a case by case basis for outdoor 
measurments. 

Finally, even though only relevant in the case of linear convolution-based sounders, we would 
like to comment on the condition AcijrP ^ 0,V/c,/. We start by noting that typical values for 
maximum multipath delay differences range from 1 |j.s for indoor environments, over 10 |a.s for 
outdoor small-cell cellular systems (with a few kilometers cell size), to 100 |j.s for a propagation 
range of 30 km encountered in outdoor large-cell cellular systems. With Au = 2.4rad/s, this 
corresponds to Aoorj!^ ranging from 2.4 ■ 10^^ rad to 2.4 ■ 10"'' rad. We can therefore conclude 
that the condition Aurf^ ^ is well satisfied in practice. 

Other imperfections: Finally, before quantifying the impact of phase errors on estimated 
MI in our calibration measurement, we need to convince ourselves that the errors observed are, 
indeed, phase errors and are not caused by other measurement imperfections. This can be done 
as follows. We start by noting that, in polar coordinates, phase errors are visible only in the 
argument of the effective channel and not in the magnitude. Assuming that all additional error 
sources can be subsumed as additive noise n which is circularly symmetric distributed, we can 
measure the SNR in the magnitude direction which, due to 

is simply the SNR of h + n. By circular symmetry of the additive noise term, the result of this 
measurement provides us with the noise caused by other error sources in the direction orthogonal 
to the magnitude direction which, by the small phase noise assumption is the phase direction. 
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This leads to an SNR level (in phase direction) of below 37.5 dB w.p. 1% for the length-511 
sequence. Compared to the corresponding phase noise SNR level of below 15.1 dB w.p. 1%, we 
can conclude that phase noise is, indeed, the dominating error source. 

C. Estimated MI for a Rank-1 Physical Channel 

Next, we illustrate the impact of phase errors on estimated MI by performing the rank-1 
"calibration procedure" described in the first paragraph of Section |Vll For brevity, we restrict 
ourselves to the length-511 sounding sequence (where phase noise is essentially fully uncorre- 
lated). 

For an SNR of p = 30 dB, Fig. I13|(a)| shows the MI obtained from the measurements along 



with the corresponding MI of the rank-1 physical channel H = 1. To illustrate the impact 
of additive noise on the channel coefficients, we also show the capacity of a rank-1 physical 
channel subject to additive white Gaussian noise (AWGN) on the channel coefficients, i.e., 
vec(H) = CA/'(q;1, cr^I) where a, real-valued and positive, and cr^ were estimated (using the 



moment-matching method [56]) from the measurement resulto on a MIMO snapshot by MIMO 
snapshot basis. The corresponding per- subchannel SNR is hence given by SNR = /a%. While 
additive errors on the channel coefficients do not impact the capacity significantly, we can see 
that the MI of the effective channel is more than 200% higher than the MI of the underlying 
physical channel. This significant measurement error can alternatively be quantified in terms of 
the MIMO channel's number of spatial degrees of freedom, using the definition provided in [8] 
as 

"^free = Ip — Ip/2- 

Fig. I13|(b")] shows mfree corresponding to Fig. ll3|(a)| We can see that the presence of phase 



noise increases mfree from 1 for the rank-1 physical channel to 8-9 for the measured channel. 

''The entries of the effective MIMO channel are, in general, not unit-modulus (due to, e.g., additive thermal noise). 
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An effective full-rank channel, i.e., a channel where all eigenvalues are significant at the given 
SNR, would correspond to mfy-cc = 16. Moreover, in Fig. [14] we show the ergodic capacity 
corresponding to the measurement results in Fig.[T3j i.e., averaged over L = 1100 MIMO 
snapshots at each SNR, along with the ergodic capacity predicted by our analytic results in 
Theorems |7] and [9] for 3.8° rms fully uncorrelated phase noise. We can see that the analytic lower 
and upper bounds are slightly higher than the measured ergodic capacity. This is probably due 
to the residual (very low since we are using length-511 sounding sequences) correlation in the 
phase noise process, neglected in the analytic results. Apart from this effect, the measurement 
results exhibit an excellent match with the analytic results. 

VIIL Conclusions 

We showed that phase errors (caused by phase noise and carrier frequency offset) in time- 
division multiplexed switching (TDMS)-based multiple-input multiple-output (MIMO) radio 
channel sounders can alter the channel statistics significantly and thereby lead to severe mutual 
information (MI) and capacity estimation errors. The impact of phase errors is most pronounced 
for low-rank physical channels where overestimation by several hundred percent can occur. 
A detailed analysis of the rank-1 physical MIMO channel revealed that realistic phase noise 
properties lead to a decorrelation of the channel matrix and result in a full-rank effective channel. 
Our analytic results are supported by measurements conducted with a commercially employed 
TDMS -based MIMO channel sounder. In the particular measurement setup considered, phase 
noise turned a physical rank-1 channel into an effective rank-8 channel. In general, for a given 
sounder, the impact of phase errors will be more pronounced for a larger number of antennas 
and for larger SISO snapshot time distances. Our analysis furthermore demonstrated that the MI 
measurement error induced by phase errors can depend significantly on the order in which the 
individual subchannels of the overall MIMO channel matrix are measured. 

Based on matrix differential calculus and matrix-variate Wirtinger calculus, we characterized 
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the sensitivity of MI w.r.t. phase errors. 

The presence of a carrier frequency offset between transmitter and receiver was found to be less 
problematic as it does not impact the MI of the physical channel under quite general conditions 
on the sounding pattern. Moreover, the impact of a carrier frequency offset is of deterministic 
nature and can be compensated by estimating the offset and subsequently removing it. 

In the light of the main findings of this paper, the results obtained through MIMO channel 
measurement campaigns using TDMS-based MIMO channel sounders should be interpreted with 
great care. To the best of our knowledge, the large majority of commercially available MIMO 
channel sounders is TDMS-based. In particular, measurement results reporting the absence of 
pin-hole or key -hole channels [2], [3], seem questionable unless a channel sounder with separate 
radio frequency (RF) chains for the individual transmit and/or receive antenna elements is used. 

Simple averaging (w.r.t. phase noise realizations) of MI is not a viable way to mitigate the 
effect of phase errors as MI is a nonlinear function of the phase noise samples and of the physical 
channel realization. Averaging of entries of the effective channel matrix over different MIMO 
snapshots is not a viable option either as it leads to a significant reduction in measurement SNR; 
this is due to the phase offsets between MIMO snapshots essentially being arbitrary. While this 
problem could, in principle, be mitigated through an increased MIMO measurement rate, the 
latter is often limited by requirements on the minimum sequence length, maximum duty cycle, 
and minimum switching times. 

A mitigation method based on taking multiple measurements per scalar subchannel at a higher 
rate with the goal of improving the phase stability across antennas and, ideally, emulating one- 
sided switching, is described, along with the corresponding tradeoffs, in [57]. This method yields 
excellent results in the case of small min(MT, Mr). 

The safest, but most expensive, solution to avoid measurement errors due to phase errors is 
to employ a channel sounder with separate RF chains for the individual transmit and/or receive 
antenna elements. We hasten to add, however, that such an architecture has other problems 
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associated with it, such as gain/phase imbalances and crosstalk between the different RF chains. 
Nevertheless, at least in theory, as shown in Section ITV- A3 [ if either the transmit antenna array 
or the receive antenna array employs a separate RF chain for each antenna element and TDMS 
is used on the other side of the link, MI is not affected by phase errors. If such a sounder 
architecture cannot be realized, the local oscillators (LOs) should be selected very carefully 
or even custom-designed which can be very expensive. As a last resort, we recommend to 
synchronize the LOs at transmitter and receiver through a cable. This solution, however, makes 
outdoor measurements difficult and limits the flexibility of the sounding device, in particular 
w.r.t. the distance between transmitter and receiver. 

Appendix A 

Theorem 10 (Fang, Kotz, and Ng [14, Th. 1.4]): Let x = Aj(a) be partitioned into k sub- 
vectors x.^^\ x*^^\ . . . , x*^'^^ and a into the corresponding subvectors a*^^), aS'^\ . . . , a'^'^^. Let yi and 
bi be, respectively, the sums of the components of x*^*) and a*^*\ and set z*^*) = x*^'y?/j. The 
following statements hold: 

i) The vectors z*^*\ z*^*"''^^ . . . , z^^\ y = [vi Vi ■ ■ ■ VkY statistically independent. 

ii) y is distributed as Dk{bi, &2, • • • , bk). 

iii) z*^*) is distributed as (a*^*)), i = 1,2, . . . , k, where Ui is the number of elements in the 
subvectors a*^*) and z^^\ 

From Theorem [To] it follows immediately that the vectors x*^'^ are Dirichlet (or beta) distributed. 
This result can be stated formally as follows. 

Theorem 11 (Fang, Kotz, and Ng [14, Th. 1.5]): If [xi X2 ■■■ Xn-i] = -D„_i(ai, 02, . . . , 
a„_i; a„), then for any k < n — 1, we have [xi x^ ■ ■ ■ a^fc] = Dk{ai, 02, . . . , a^; 6), where 

b = afc+i + afc+2H l-«n- In particular, for any i (i = 1,2, . . . ,n — l), we have Xj = f3{ai,a — ai) 

with a = Y,"^^! «j- 
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Lemma 4: The first derivative '^'{z) = d'^{z)/dz of the digamma function "^{z) defined in 
^ satisfies 



Proof: Using we have 



\2 



2J j^^{p + iy ^(2p + a;)2 (2 + x)^ (4 + x) 

X + 1 



OO ^ OO 



4 



V ^ V ^ I = 4 ^'(x) - ^' 



which upon noting that 



-r I fX\ ^ , / X + 1 



completes the proof. 

Lemma 5: The eigenvalues of the matrix 



n=l 



Re(A,) = Im, - J2 ^1^1 * = 2, 3, . . . , Mr 



with Aj and s:^ defined in (|53l) and s.„ defined through S = [si S2 ... smr = — + $ 
with S as in ([SB, are given by (A; = 1, 2, ... , Mp) 

{af } = { 0_^, r/«, 1 - r^W} 

where r/W = rj'-'^sj;, s^, . . . , s^^_J G [0, 1]. 
Proof: We start by writing 

^-^4 Re(s;^) (Re(s;^))^ ^ Im(s,i) (lm(s;^))^ 
Ke(Ai) = Imt - 2^ u-^r2 2^ 



n=l ll "11 n=l II "II 
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where 



Gi = [Re(Ni) Im(Ni)] with 



l^l"ll ll^2"ll ll^i^lll J 



By slight abuse of notation, in the remainder of this proof, we let Afc(X) denote the unordered 
eigenvalues of the matrix X. It follows that a^^ = 1 — Ajt(GjGf ), k = 1,2,..., Mp. Invoking 
Lemma [6l we can conclude that the 2{i — 1) eigenvalues Afc(Gf Gj) are given by 



1 1 rTrT 1 1 / (i^ ' 1 1 / (i\ 1 1 



2 + 2 Vf i" . ^ - ^ Vf i" . ■ ■ ■ . ^ + ^ Vfl'^i . -2 - ^ Vfffi I m 

with /i^*^ G M (fc = 1, 2, . . . , 2 — 1). Therefore, when paired properly, the Afc(GfGj) pairwise 
add up to 1 . This property will next allow us to show that Gf G^ has i — 2 eigenvalues equal to 
I, i — 2 eigenvalues equal to 0, and one pair of eigenvalues given by {r]^''\ 1 — r/*^*)}. We start 
by noting that, using ([53]), it follows for z = 2, 3, . . . , Mr that (recall that Sj G M^^^) 



is-ii/ — \K\\' 



^ Im(s^) (Re(s ^))^- Re(s^) (im(s^))^ 



llc-L||2 llc-L|| 
„=1 ll^n II ll^i II 

i-1 

= ^(lm(s^)en + Re(s^)Cn) (81) 

n=l 

^ f 4 \ (j ^ M^^) {M^^)f ^ Ms^) {Ms^)y 

\ iPi II / \ iPn II „_i IPn II 



n.=l " " " n=l 

i-1 



5^(Re(s^)en + Im(s^)Cn) (82) 



n=l 



with ^„ G M. The significance of ([8TI) and (l82l) is that it can be used to show that r(Gj) = 
i. More specifically, starting with i = 2, we note that G2 = [Re(si/||si||) Im(si/||si ||) ] 
has rank 2 w.p.l as will be shown first. Noting that Re(si) = A/'(0, cr^I) and Im(si) = 
[-MtMr, Oi,a/t-i]^' it follows that ||si|| > w.p.l and hence r(G2) = r([Re(si) Im(si)]). 
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By definition, Re(si) and Im(si) are linearly independent w.p.l and hence ^(62) = 2 w.p.l. Now 
with each increase in i, two vectors are added to Gj, where one, namely Im(s^^/||s^]^||) by (|8TI) . 
is a linear combination of the vectors already in Gj, and the other one, namely Re(s^]^/||s^ J) 
by (|82l) . is a linear combination of the vectors already in Gj and the vector Sj_i/||s^^||. We can 
therefore conclude that the 2(z — 1) x 2(z — 1) matrix Gf Gj has at most i nonzero eigenvalues. The 
remaining i — 2 eigenvalues are equal to zero. By (|80l) we must therefore have i — 2 eigenvalues 
which are equal to 1. Since we have 2i — 2 eigenvalues in total, it follows that there is one 
pair of eigenvalues of the form {r]^^\ 1 — 77*^*^}. Finally, noting that i]^'-^ and 1 — 77*^*^ must be 
real- valued and positive, we can conclude that r/^*) G [0, 1]. ■ 
Lemma 6: Given an orthonormal set of vectors xi,X2,...,x„ G with n < N, the 
eigenvalues of the 2n x 2n matrix Y^Y with 



Y = [Re(X) Im(X)] 



where 



X = [ xi X2 



are given by 




2 ' 2 ' 2 



(83) 




where A = X^XX^X*. 



Proof: We start by noting that 
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satisfies 







1 1 


\ 


Y' = Y 















u 



wliere U is unitary. We can therefore conclude that 













[X x*]j 


v 







A,(U^Y^YU) = Ai(Y^Y). 



Next, we note that X^X = X^X* = I„ due to the orthonormality of the x„, and hence the 
eigenvalues of Y^Y are given by the solutions of the characteristic equation 



p(/i) = det 



V 



;i-2/i)i x^x* 

X^X (1 - 2/i)I 
For a Hermitian matrix S partitioned according to 

A B 
B* C 

where A and C are square matrices, we have from the Schur complement formula [43, Sec. 0.8.5] 

det(S) = det(A) det(C - B*A-iB). 



It therefore follows that 



p{n) = det((l - 2/i)2l - X'^XX^X*). 



(84) 



The solution of (f84l) yields the eigenvalues of Y^Y as (|83l) . which completes the proof. ■ 
Note that if n > N/2, Y must have 2n — N eigenvalues equal to and therefore by (|83l) also 
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2n — N eigenvalues equal to 1 so that ([83] ) can be refined as 
i + v/MaT 1- v^MAy 1 + 1- v/A^ 



2 ' 2 



1 + v/A^„„(A)- 1- v/A^^„(A)- ^ ...,o| 



2n-7V 2n-Ar 



Theorem 12: Given the conditional (on r]) RV^j ^'(r/) = XicrZr^ + Xicr2(i-r))' where the two 
chi-square distributed terms are independent, and a constant r G we have 

E{log(r + Z(0))} <E{log(r + Z(r7))}, < r/ < 1. (85) 

Proof: We start by noting that Z{ri) = r/) for < r] < 1 so that it suffices to prove (f85l) 
for < r/ < 1/2. We shall also need the properties Z{0) = Z{1) = x\^2 and Z{l/2) = x\^2 j^- 
Noting that 

log(r + Z{rj)) = log(r) + log(l + Z{r])/r) 
= log(r) + log(l + 

where Z{r]) = xj^a^r, + Xi,s2(i-r;) ^^'^ = (^^Z^' it follows that proving ([85]) for r = 1 is 
sufficient. 

The cdf of Y{^) = x?,^ is [15, Eq. 26.4.19, Eq. 6.5.16, Eq. 6.1.8] 



Fn?)(l/)=erf(^^|j (86) 

where erf(x) = {2/^/^F) J^^ exp(— t^) dt denotes the error function. The inverse function cor- 
responding to ([861) is given by F^^^-|(x) = 2^erfi^(x), where erfi(x) denotes the inverse error 
function. Using the inverse method of generating random deviates [15, Sec. 26.8], i.e., for a 

'^'For the sake of simplicity of notation, we committed an abuse of notation iiere, as in tiie main body (cf. Theorem|5} we 
used the symbol Z{ri) to denote the unconditional RV Z{ri) — (T^(r;Xi + (1 — ri)X2). 
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uniformly distributed RV U G [0, 1] we have X15 = together with the independence 

of the two terms in Z(ri) = x? 2„ + X? 2n_r,v we can now express Z(ri) in terms of two 
independent uniformly distributed RVs f/i G [0, 1] and t/2 G [0, 1] as 

Z(r7) = 2^277 erfi2(f/i) + 2(t2(1 - r]) ei&{U2). (87) 

To prove the Theorem, it suffices to show that 

|^E{log(l + Z(r/))} >0, 0<r/<l/2. (88) 



Inserting ([87]) in ([88]), we get 

d_ 
dr] 



-E{log(l + Z(r/))} = — / / log (1 + 2(7277 erfi2(ui) + 2(T2(l-r/)erfi2(u2))duidM2 

c"? Jo Jo 



(a) 



1 /.I 



2a2(erfi2(Mi) -erfi2(M2)) 



log(6) / / 1,09 H?7 — N , o 9/1 \ — H?7 — rdnidM2 



JO 
•1 /.I 



1 + 2a'^ri eT&{ui) + 2cr^(l — rj) ei&{u2) 



= log(e) / / /i(Mi,U2)/2(Mi,M2) dMidM2 (89) 

Jo Jo 

where in (a) we interchanged expectation and differentiation (noting that the integrand is con- 
tinuous for all T] G [0, 1]), and we set 

/i(mi,M2) = erfi2(Mi) - erR'^{u2) 

/2(wi, U2) = (l/(2a2) + r/erfi2(Mi) + (1 - r/) eiR^iu^))'' . 

Note that /i(mi,M2) is negative symmetric, i.e., /i(ui,U2) = — /i(m2,mi)- Furthermore, the 
monotonicity of erfi(u) implies that for ui > U2 we have fi{ui,U2) > 0, and for ui < U2 it 
holds that fi{ui,U2) < 0. We will now exploit these properties of fi{ui,U2) to complete the 
proof and start by rewriting the integral in (l89l) according to 



/ / fl{Ul,U2)f2{Ul,U2)dUidU2 
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(/ fi{ui,v)f2{ui,v)dui+ fi{v,U2)f2{v,U2)du2] dv 

V=0 \J Ul=V J U2=V / 

I h{u,v){f2{u,v)- f2{v,u))dudv (90) 

ti=0 J u=v 

where fi(u,v) > in the entire range of integration since u > v. Finally, we will show that 
f2{u,v) — f2{v,u) > 0, which by (|90l) then implies (f88l) . Straightforward manipulations reveal 
that the condition f2{u,v) — f2{v,u) > is equivalent to 

?7erfi^(M) + (1 — r/) erfi^(f ) < ?7erfi^(f ) + (1 — r^) erfi^(M) 
{2r] - 1) erfi2(M) < (2r/ - 1) ei&{v) 

and hence by < < 1/2 to 

erfi^('u) > erfi^(t') 

which is satisfied for m > f (i.e., over the entire range of integration) because erfi(x) is 
nondecreasing. ■ 
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Fig. 1. Architecture of a TDMS-based MIMO channel sounder. ADC, DAC, MIX, MPU, BPF, and CLK stand for analog-to- 
digital converter, digital-to-analog converter, mixer, multiplexing unit, bandpass filter, and (reference) clock, respectively. 




Fig. 2. Ergodic capacity of an 8 x 8 effective channel for fully uncorrelated phase noise with varying rms values and various 
r(R) of the underlying physical channel. 
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MI [bit/s/Hz] 



Fig. 3. cdf of mutual information at SNR p = 20 dB showing the impact of the loss of joint Gaussianity for a 4 x 4 physical 
channel with [Corr(H)]m,n = c, Vm 7^ ti, and fully uncorrelated 7° rms phase noise. 



TABLE I 

Channel Sounder and (Two Different) Measurement Setup Parameters 



Carrier frequency 
Frequency generation 
Sounding sequence 
Sounding pattern 
Chip rate 


5.25 GHz 
Rubidium reference and PLL-VCO 
bipolar m-sequence, rectangular chip pulses 
regular (see Section HV-A3I) 
100 MHz 


Sequence length (chips) 


511 


31 


MIMO configuration (Mr x Mp) 


16 X 16 


23x23 


MIMO snapshot distance Tmimo 


19.46 ms 


10.27 ms 


Receive-side SISO snapshot distance Tr 


10.22 |j,s 


0.93 M-S 


Ratio Tt/Tr 


17 


24 


SISO snapshot duration \3p\ 


5.11 M-s 


0.31 M-s 
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(a) First moment of I and of second order Taylor series expansion I 



0.7 



1.5 r 



0.5 




0^ ' ' 1 
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(Var{/})i/2 

(b) Second moment of / and of second order Taylor series expansion I 

Fig. 4. Moments of mutual information / and of mutual information second order Taylor series expansion J, botli for 2000 
realizations of an i.i.d. Gaussian physical channel subject to fully uncorrelated 3.5° rms phase noise at SNR p = 30 dB. Moments 
of / are obtained from Monte Carlo simulation over 10 000 phase noise samples per physical channel reaUzation. 
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(a) cdfs corresponding to the second-order Taylor series approximation of mutual information 
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(b) cdfs corresponding to second-order Taylor series approximation of channel eigenvalues 

Fig. 5. cdfs of mutual information and of corresponding approximation based on second-order Taylor series expansion ( |27b . 
as well as second-order Taylor series approximation of eigenvalues, for realizations of a (from left to right) rank-1, balanced 
rank-2, balanced rank-3, rank-4, and balanced rank-4 physical 4x4 channel subject to fully uncorrelated 3.5° rms phase noise 
(10000 realizations) at SNR p = 30 dB. For all channels, we employed the normalization ||H||p — MtMh- 
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logdet(00^) 



Fig. 6. cdf of logdet(00^) and analytic approximation l l56l l, for various Mt,Mr, and different phase noise levels. For 
comparison, logdet(©©^) is also shown. A regular sounding pattern according to i26t was assumed. All results are based on 
300000 Monte Carlo runs. 




Fig. 7. Ergodic capacity of a 4 x 4 rank-1 physical channel with unit-modulus entries subject to 7° rms fully uncorrelated 
phase noise. Exact results are obtained through Monte Carlo simulation. 
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Fig. 8. Exact variance of mutual information and analytic approximation ( I76t at high SNR for Mr = 10 as a function of 
Mt for a rank-1 physical channel with unit-modulus entries subject to 3.5° rms fully uncorrelated phase noise, and for an i.i.d. 
Rayleigh fading channel. Exact results are obtained through Monte Carlo simulation. 
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Fig. 9. Long-term phase noise behavior of a commercially employed TDMS-based MIMO channel sounder (length-31 sounding 
sequence). 
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(b) Phase-noise trace of a single MIMO snapshot (length-511 sounding sequence) 
Fig. 10. Short-term phase noise behavior of a commercially employed TDMS-based MIMO channel sounder. 
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Fig. 11. cdf of the measured MIMO snapshot phase noise traces (i.e., overall phase after removal of estimated linear component) 
compared to Af{0, (3.9°)^) reference distribution. 
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Fig. 12. •\/<3'S {n)/2' for length-31 and length-511 sounding sequences. 
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(a) Mutual information for measured channel and for reference rank-1 physical channel 
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(b) Number of degrees of freedom mfrcc corresponding to results in (a) 

Fig. 13. Mutual information and corresponding number of degrees of freedom mfrco at SNR p = 30 dB for Mt = A/r — 16 
configuration of Table |l] and a rank-1 "calibration measurement" conducted on a commercially employed TDMS-based MIMO 
channel sounder. For reference, we also show the mutual information corresponding to the rank-1 physical channel H = 1 
and the mutual information corresponding to a rank-1 physical channel subject to AWGN on the channel coefficients (see 
Section lVira . 
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